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INSECTICIDAL TOXINS FROM PHOTORHABDUS 

The invention relates to novel toxins from Photorhabdus luminescens, nucleic acid 
sequences whose expression results in said toxins, and methods of making and methods of 
using the toxins and corresponding nucleic acid sequences to control insects. 

Insect pests are a major cause of crop losses. Solely in the US, about $7.7 billion 
are lost every year due to infestation by various genera of insects. In addition to losses in 
field crops, insect pests are also a burden to vegetable and fruit growers, to producers of 
ornamental flowers, and they are a nuisance to gardeners and home owners. 

Insect pests are mainly controlled by intensive applications of chemical insecticides, 
which are active through inhibition of insect growth, prevention of insect feeding or 
reproduction, or death of the insects. Good insect control can thus be reached, but these 
chemicals can sometimes also affect other, beneficial insects. Another problem resulting 
from the wide use of chemical pesticides is the appearance of resistant insect varieties. 
This has been partially alleviated by various resistance management strategies, but there is 
an increasing need for alternative pest control agents. Biological insect control agents, 
such as Bacillus thuringlensis strains expressing insecticidal toxins like d-endotoxins t have 
also been applied with satisfactory results, offering an alternative or a complement to 
chemical insecticides. Recently, the genes coding for some of these d-endotoxins have 
been isolated and their expression in heterologous hosts have been shown to provide 
another tool for the control of economically important insect pests. In particular, the 
expression of insecticidal toxins in transgenic plants, such as Bacillus thuringiensis d- 
endotoxins, has provided efficient protection against selected insect pests, and transgenic 
plants expressing such toxins have been commercialized, allowing farmers to reduce 
applications of chemical insect control agents. Yet, even in this case, the development of 
resistance remains a possibility and only a few specific insect pests are controllable. 
Consequently, there remains a long-felt but unfulfilled need to discover new and effective 
insect control agents that provide an economic benefit to farmers and that are 
environmentally acceptable. 

The present invention addresses the need for novel insect control agents. 
Particularly needed are control agents that are targeted to economically important insect 
pests and that efficiently control insect strains resistant to existing insect control agents. 
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Furthermore, agents whose application minimizes the burden on the environment are 
desirable. 

In the search of novel insect control agents, certain classes of nematodes from the 
genera Heterorhabdus and Steinemema are of particular interest because of their 
insecticidal properties. They kill insect larvae and their offspring feed in the dead larvae. 
Indeed, the insecticidal activity is due to symbiotic bacteria living in the nematodes. These 
symbiotic bacteria are Photorhabdus in the case of Heterorhabdus and Xenorhabdus in the 
case of Steinemema. 

The present invention is drawn to nucleic acid sequences isolated from Photorhabdus 
luminescens, and sequences substantially similar thereto, whose expression results in 
toxins that are highly toxic to economically important insect pests, particularly insect pests 
that infest plants. The invention is further drawn to the toxins resulting from the expression 
of the nucleic acid sequences, and to compositions and formulations containing the toxins, 
which are capable of inhibiting the ability of insect pests to survive, grow or reproduce, or of 
limiting insect-related damage or loss in crop plants. The invention is further drawn to a 
method of making the toxins and to methods of using the nucleic acid sequences, for 
example in microorganisms to control insects or in transgenic plants to confer insect 
resistance, and to a method of using the toxins, and compositions and formulations 
comprising the toxins, for example applying the toxins or compositions or formulations to 
insect-infested areas, or to prophylactically treat insect-susceptible areas or plants to confer 
protection or resistance to the insects. 

The novel toxins are highly active against insects. For example, a number of 
economically important insect pests, such as the Lepidopterans Plutella xylostella 
(Diamondback Moth), Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Corn 
Borer), Heliothis virescens (Tobacco Budworm), Helicoverpa zea (Corn Earworm), Manduca 
sexta (Tobacco Hornworm), Spodoptera exigua (Beet Armyworm), and Spodoptera 
frugiperda (Fall Armyworm), as well as the Coleopterans Diabrotica virgifera virgifera 
(Western Corn Rootworm), Diabrotica undecimpunctata howardi (Southern Corn 
Rootworm), and Leptinotarsa decimlineata (Colorado Potato Beetle) can be controlled by 
one or more of the toxins. The toxins can be used in multiple insect control strategies, 
resulting in maximal efficiency with minimal impact on the environment. 

According to one aspect, the present invention provides an isolated nucleic acid 
molecule comprising: (a) a nucleotide sequence substantially similar to a nucleotide 
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sequence selected from the group consisting of: nucleotides 412-1665 of SEQ ID NO:1, 
nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, 
nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515-9269 of SEQ ID NO:1, 
nucleotides 15,171-18,035 of SEQ ID NO:11 f and nucleotides 31,393-35,838 of SEQ ID 
NO:11; (b) a nucleotide sequence comprising nucleotides 23,768-31,336 of SEQ ID NO:11; 
or (c) a nucleotide sequence isocoding with the nucleotide sequence of (a) or (b); wherein 
expression of the nucleic acid molecule results in at least one toxin that is active against 
insects. 

In one embodiment of this aspect, the nucleotide sequence is isocoding with a 
nucleotide sequence substantially similar to nucleotides 412-1665 of SEQ ID NO:1, 
nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, 
nucleotides 3342-4118 of SEQ ID NO:1, or nucleotides 4515-9269 of SEQ ID NO:1. 
Preferably, the nucleotide sequence is substantially similar to nucleotides 412-1665 of SEQ 
ID NO:1, nucleotides 1686-2447 of SEQ ID NO:1 t nucleotides 2758-3318 of SEQ ID NO:1, 
nucleotides 3342-4118 of SEQ ID NO:1, or nucleotides 4515-9269 of SEQ ID NO:1. More 
preferably, the nucleotide sequence encodes an amino acid sequence selected from the 
group consisting of SEQ ID NOs:2-6. Most preferably, the nucleotide sequence comprises 
nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of SEQ ID NO:1 t nucleotides 
2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1, or nucleotides 4515- 
9269 of SEQ ID NO:1. 

In another embodiment of this aspect, the nucleotide sequence is isocoding with a 
nucleotide sequence substantially similar to nucleotides 15,171-18,035 of SEQ ID NO:11. 
Preferably, the nucleotide sequence is substantially similar to nucleotides 15,171-18,035 of 
SEQ ID NO:11. More preferably, the nucleotide sequence encodes the amino acid 
sequence set forth in SEQ ID NO:12. Most preferably, the nucleotide sequence comprises 
nucleotides 15,171-18,035 of SEQ ID NO:11. 

In still another embodiment of this aspect, the nucleotide sequence is isocoding with 
a nucleotide sequence substantially similar to nucleotides 31 ,393-35,838 of SEQ ID NO:1 1 . 
Preferably, the nucleotide sequence is substantially similar to nucleotides 31,393-35,838 of 
SEQ ID NO:11. More preferably, the nucleotide sequence encodes the amino acid 
sequence set forth in SEQ ID NO: 14. Most preferably, the nucleotide sequence comprises 
nucleotides 31 ,393-35,838 of SEQ ID NO:1 1 . 



WO 99/42589 



-4- 



PCT/EP99/01015 



In yet another embodiment of this aspect, the nucleotide sequence encodes the 
amino acid sequence set forth in SEQ ID NO:13, and preferably comprises nucleotides 
23,768-31 1 336 of SEQ ID NO:1 1 . 

In one embodiment, the nucleotide sequence of the invention comprises the 
approximately 9.7 kb DNA fragment harbored in E coli strain DH5a, designated as NRRL 
accession number B-21835. 

In another embodiment, the nucleotide sequence of the invention comprises the 
approximately 38 kb DNA fragment harbored in E. coli strain DH5a, designated as NRRL 
accession number B-30077. 

In still another embodiment, the nucleotide sequence of the invention comprises the 
approximately 22.2 kb DNA fragment harbored in E coli strain DH5a, designated as NRRL 
accession number B-30078. 

According to one embodiment of the invention, the toxins resulting from expression of 
the nucleic acid molecules of the invention have activity against Lepidopteran insects. 
Preferably, according to this embodiment, the toxins have activity against Plutella xylostella 
(Diamondback Moth), Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Corn 
Borer), Heliothis virescens (Tobacco Budworm), Helicoverpa zea (Corn Earworm), 
Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall Armyworm). 

According to another embodiment of the invention, the toxins resulting from 
expression of the nucleic acid molecule of the invention have activity against Lepidopteran 
and Coleopteran insects. Preferably, according to this embodiment, the toxins have 
insecticidal activity against Plutella xylostella (Diamondback Moth), Ostrinia nubilalis 
(European Com Borer), and Manduca sexta (Tobacco Hornworrn), Diabrotica virgifera 
virgifera (Western Corn Rootworm), Diabrotica undecimpunctata howardi (Southern Com 
Rootworm), and Leptinotarsa decimlineata (Colorado Potato Beetle). 

in another aspect, the present invention provides an isolated nucleic acid molecule 
comprising a 20 base pair nucleotide portion identical in sequence to a consecutive 20 base 
pair nucleotide portion of a nucleotide sequence selected from the group consisting of: 
nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 
2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515- 
9269 of SEQ ID NO:1, nucleotides 15,171-18,035 of SEQ ID NO:11, and nucleotides 
31,393-35,838 of SEQ ID NO:11, wherein expression of the nucleic acid molecule results in 
at least one toxin that is active against insects. 
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In one embodiment of this aspect, the isolated nucleic acid molecule of the invention 
comprises a 20 base pair nucleotide portion identical in sequence to a consecutive 20 base 
pair nucleotide portion of nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of 
SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID 
NO:1, or nucleotides 4515-9269 of SEQ ID NO:1. 

In another embodiment of this aspect, the isolated nucleic acid molecule of the 
invention comprises a 20 base pair nucleotide portion identical in sequence to a 
consecutive 20 base pair nucleotide portion of nucleotides 15,171-18,035 of SEQ ID 
NO:11. 

In still another embodiment of this aspect, the isolated nucleic acid molecule of the 
invention comprises a 20 base pair nucleotide portion identical in sequence to a 
consecutive 20 base pair nucleotide portion of nucleotides 31,393-35,838 of SEQ ID 
NO:11. 

In a further aspect, the present invention provides an isolated nucleic acid molecule 
comprising a nucleotide sequence from Photorhabdus luminescens selected from the group 
consisting of: nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of SEQ ID 
NO:1, nucleotides 2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1. 
nucleotides 4515-9269 of SEQ ID NO:1, nucleotides 66-1898 of SEQ ID NO:11, 
nucleotides 2416-9909 of SEQ ID NO:11, the complement of nucleotides 2817-3395 of 
SEQ ID NO:11, nucleotides 9966-14,633 of SEQ ID NO:11, nucleotides 14,699-15,007 of 
SEQ ID NO:11, nucleotides 15,171-18,035 of SEQ ID NO:11, the complement of 
nucleotides 17,072-17,398 of SEQ ID NO:11, the complement of nucleotides 18,235- 
19,167 of SEQ ID NO:1 1 , the complement of nucleotides 19,385-20,1 16 of SEQ ID NO:1 1 , 
the complement of nucleotides 20,217-20,963 of SEQ ID NO:11, the complement of 
nucleotides 22,172-23,086 of SEQ ID NO:1 1 , nucleotides 23,768-31 ,336 of SEQ ID NO:1 1 , 
nucleotides 31,393-35,838 of SEQ ID NO:11, the complement of nucleotides 35,383- 
35,709 of SEQ ID NO:11, the complement of nucleotides 36,032-36,661 of SEQ ID NO:11, 
and the complement of nucleotides 36,654-37,781 of SEQ ID NO:1 1 . 

The present invention also provides a chimeric gene comprising a heterologous 
promoter sequence operatively linked to the nucleic acid molecule of the invention. Further, 
the present invention provides a recombinant vector comprising such a chimeric gene. Still 
further, the present invention provides a host cell comprising such a chimeric gene. A host 
cell according to this aspect of the invention may be a bacterial cell, a yeast cell, or a plant 
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cell, preferably a plant cell. Even further, the present invention provides a plant comprising 
such a plant cell. Preferably, the plant is maize. 

In yet another aspect, the present invention provides toxins produced by the 
expression of DNA molecules of the present invention. 

According to one embodiment, the toxins of the invention have activity against 
Lepidopteran insects, preferably against Plutella xylostella (Diamondback Moth), 
Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Com Borer), Heliothis 
virescens (Tobacco Budworm), Helicoverpa zea (Corn Earworm), Spodoptera exigua (Beet 
Armyworm), and Spodoptera frugiperda (Fall Armyworm). 

According to another embodiment, the toxins of the invention have activity against 
Lepidopteran and Coleopteran insects, preferably against Plutella xylostella (Diamondback 
Moth), Ostrinia nubilalis (European Com Borer), and Manduca sexta (Tobacco Homworm), 
Diabrotica virgifera virgifera (Western Corn Rootworm), Diabrotica undecimpunctata 
howardi (Southern Com Rootworm), and Leptinotarsa decimlineata (Colorado Potato 
Beetle). 

In one embodiment, the toxins are produced by the £. coli strain designated as 
NRRL accession number B-21 835. 

In another embodiment, the toxins are produced by E. coli strain designated as 
NRRL accession number B-30077. 

In still another embodiment, the toxins are produced by E. coli strain designated as 
NRRL accession number B-30078. 

In one embodiment, a toxin of the invention comprises an amino acid sequence 
selected from the group consisting of: SEQ ID NOs:2-6. 

In another embodiment, a toxin of the invention comprises an amino acid sequence 
selected from the group consisting of: SEQ ID NOs:l2-14. 

The present invention also provides a composition comprising an insecticidally 
effective amount of a toxin according to the invention. 

In another aspect, the present invention provides a method of producing a toxin that is 
active against insects, comprising: (a) obtaining a host cell comprising a chimeric gene, 
which itself comprises a heterologous promoter sequence operatively linked to the nucleic 
acid molecule of the invention; and (b) expressing the nucleic acid molecule in the cell, 
which results in at least one toxin that is active against insects. 
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ln a further aspect, the present invention provides a method of producing an insect- 
resistant plant, comprising introducing a nucleic acid molecule of the invention into the 
plant, wherein the nucleic acid molecule is expressible in the plant in an effective amount to 
control insects. According to one embodiment, the insects are Lepidopteran insects, 
preferably selected from the group consisting of: Plutelia xylostella (Diamondback Moth), 
Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Corn Borer), Heliothis 
virescens (Tobacco Budworm), Helicoverpa zea (Com Earworm), Spodoptera exigua (Beet 
Armyworm), and Spodoptera frugiperda (Fall Armyworm). According to another 
embodiment, the insects are Lepidopteran and Coleopteran insects, preferably selected 
from the group consisting of: Plutelia xylostella (Diamondback Moth), Ostrinia nubilalis 
(European Corn Borer), and Manduca sexta (Tobacco Hornworm), Diabrotica virgifera 
virgifera (Western Corn Rootworm), Diabrotica undecimpunctata howardi (Southern Corn 
Rootworm), and Leptinotarsa. decimlineata (Colorado Potato Beetle). 

In still a further aspect, the present invention provides a method of controlling insects 
comprising delivering to the insects an effective amount of a toxin according to the present 
invention. According to one embodiment, the insects are Lepidopteran insects, preferably 
selected from the group consisting of: Plutelia xylostella (Diamondback Moth), Trichoplusia 
ni (Cabbage Looper), Ostrinia nubilalis (European Corn Borer), Heliothis virescens 
(Tobacco Budworm), Helicoverpa zea (Com Earworm), Spodoptera exigua (Beet 
Armyworm), and Spodoptera frugiperda (Fall Armyworm). According to another 
embodiment, the insects are Lepidopteran and Coleopteran insects, preferably selected 
from the group consisting of: Plutelia xylostella (Diamondback Moth), Ostrinia nubilalis 
(European Com Borer), and Manduca sexta (Tobacco Hornworm), Diabrotica virgifera 
virgifera (Western Com Rootworm), Diabrotica undecimpunctata howardi (Southern Corn 
Rootworm), and Leptinotarsa decimlineata (Colorado Potato Beetle). Preferably, the toxin 
is delivered to the insects orally. 

Yet another aspect of the present invention is the provision of a method for 
mutagenizing a nucleic acid molecule according to the present invention, wherein the 
nucleic acid molecule has been cleaved into population of double-stranded random 
fragments of a desired size, comprising: (a) adding to the population of double-stranded 
random fragments one or more single- or double-stranded oligonucleotides, wherein the 
oligonucleotides each comprise an area of identity and an area of heterology to a double- 
stranded template polynucleotide; (b) denaturing the resultant mixture of double-stranded 
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random fragments and oligonucleotides into single-stranded fragments; (c) incubating the 
resultant population of single-stranded fragments with a polymerase under conditions which 
result in the annealing of the single-stranded fragments at the areas of identity to form pairs 
of annealed fragments, the areas of identity being sufficient for one member of a pair to 
prime replication of the other, thereby forming a mutagenized double-stranded 
polynucleotide; and (d) repeating the second and third steps for at least two further cycles, 
wherein the resultant mixture in the second step of a further cycle includes the mutagenized 
double-stranded polynucleotide from the third step of the previous cycle, and wherein the 
further cycle forms a further mutagenized double-stranded polynucleotide. 

Other aspects and advantages of the present invention will become apparent to those 
skilled in the art from a study of the following description of the invention and non-limiting 
examples. 

DEFINITIONS 

"Activity" of the toxins of the invention is meant that the toxins function as orally 
active insect control agents, have a toxic effect, or are able to disrupt or deter insect 
feeding, which may or may not cause death of the insect. When a toxin of the invention is 
delivered to the insect, the result is typically death of the insect, or the insect does not feed 
upon the source that makes the toxin available to the insect. 

"Associated with / operatively linked" refer to two nucleic acid sequences that are 
related physically or functionally. For example, a promoter or regulatory DNA sequence is 
said to be "associated with" a DNA sequence that codes for an RNA or a protein if the two 
sequences are operatively linked, or situated such that the regulator DNA sequence will 
affect the expression level of the coding or structural DNA sequence. 

A "chimeric gene" is a recombinant nucleic acid sequence in which a promoter or 
regulatory nucleic acid sequence is operatively linked to, or associated with, a nucleic acid 
sequence that codes for an mRNA or which is expressed as a protein, such that the 
regulator nucleic acid sequence is able to regulate transcription or expression of the 
associated nucleic acid sequence. The regulator nucleic acid sequence of the chimeric 
gene is not normally operatively linked to the associated nucleic acid sequence as found in 
nature. 
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A "coding sequence" is a nucleic acid sequence that is transcribed into RNA such as 
mRNA, rRNA, tRNA, snRNA, sense RNA or antisense RNA. Preferably the RNA is then 
translated in an organism to produce a protein. 

To "control" insects means to inhibit, through a toxic effect, the ability of insect pests 
to survive, grow, feed, and/or reproduce, or to limit insect-related damage or loss in crop 
plants. To "control" insects may or may not mean killing the insects, although it preferably 
means killing the insects. 

To "deliver" a toxin means that the toxin comes in contact with an insect, resulting in 
toxic effect and control of the insect. The toxin can be delivered in many recognized ways, 
e.g., orally by ingestion by the insect or by contact with the insect via transgenic plant 
expression, formulated protein composition(s), sprayable protein composition(s), a bait 
matrix, or any other art-recognized toxin delivery system. 

"Expression cassette" as used herein means a nucleic acid sequence capable of 
directing expression of a particular nucleotide sequence in an appropriate host cell, 
comprising a promoter operably linked to the nucleotide sequence of interest which is 
operably linked to termination signals. It also typically comprises sequences required for 
proper translation of the nucleotide sequence. The expression cassette comprising the 
nucleotide sequence of interest may be chimeric, meaning that at least one of its 
components is heterologous with respect to at least one of its other components. The 
expression cassette may also be one which is naturally occurring but has been obtained in 
a recombinant form useful for heterologous expression. Typically, however, the expression 
cassette is heterologous with respect to the host, i.e., the particular nucleic acid sequence 
of the expression cassette does not occur naturally in the host cell and must have been 
introduced into the host cell or an ancestor of the host cell by a transformation event. The 
expression of the nucleotide sequence in the expression cassette may be under the control 
of a constitutive promoter or of an inducible promoter which initiates transcription only when 
the host cell is exposed to some particular external stimulus. In the case of a multicellular 
organism, such as a plant, the promoter can also be specific to a particular tissue, or organ, 
or stage of development. 

A "gene" is a defined region that is located within a genome and that, besides the 
aforementioned coding nucleic acid sequence, comprises other, primarily regulatory, nucleic 
acid sequences responsible for the control of the expression, that is to say the transcription 
and translation, of the coding portion. A gene may also comprise other 5' and 3' 
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untranslated sequences and termination sequences. Further elements that may be present 
are, for example, introns. 

"Gene of interest" refers to any gene which, when transferred to a plant, confers upon 
the plant a desired characteristic such as antibiotic resistance, virus resistance, insect 
resistance, disease resistance, or resistance to other pests, herbicide tolerance, improved 
nutritional value, improved performance in an industrial process or altered reproductive 
capability. The "gene of interest" may also be one that is transferred to plants for the 
production of commercially valuable enzymes or metabolites in the plant. 

A "heterologous" nucleic acid sequence is a nucleic acid sequence not naturally 
associated with a host cell into which it is introduced, including non-naturally occurring 
multiple copies of a naturally occurring nucleic acid sequence. 

A "homologous" nucleic acid sequence is a nucleic acid sequence naturally 
associated with a host cell into which it is introduced. 

"Homologous recombination" is the reciprocal exchange of nucleic acid fragments 
between homologous nucleic acid molecules. 

"Insecticidal" is defined as a toxic biological activity capable of controlling insects, 
preferably by killing them. 

A nucleic acid sequence is "isocoding with" a reference nucleic acid sequence when 
the nucleic acid sequence encodes a polypeptide having the same amino acid sequence as 
the polypeptide encoded by the reference nucleic acid sequence. 

An "isolated" nucleic acid molecule or an isolated enzyme is a nucleic acid molecule 
or enzyme that, by the hand of man, exists apart from its native environment and is 
therefore not a product of nature. An isolated nucleic acid molecule or enzyme may exist in 
a purified form or may exist in a non-native environment such as, for example, a 
recombinant host cell. 

A "nucleic acid molecule" or "nucleic acid sequence" is a linear segment of single- or 
double-stranded DNA or RNA that can be isolated from any source. In the context of the 
present invention, the nucleic acid molecule is preferably a segment of DNA. 

"ORP means open reading frame. 

A "plant" is any plant at any stage of development, particularly a seed plant. 

A "plant cell" is a structural and physiological unit of a plant, comprising a protoplast 
and a cell wall. The plant cell may be in form of an isolated single cell or a cultured cell, or 
as a part of higher organized unit such as, for example, plant tissue, a plant organ, or a 
whole plant. 
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"Plant cell culture" means cultures of plant units such as, for example, protoplasts, 
ceil culture cells, cells in plant tissues, pollen, pollen tubes, ovules, embryo sacs, zygotes 
and embryos at various stages of development. 

"Plant material" refers to leaves, stems, roots, flowers or flower parts, fruits, pollen, 
egg cells, zygotes, seeds, cuttings, cell or tissue cultures, or any other part or product of a 
plant. 

A "plant organ" is a distinct and visibly structured and differentiated part of a plant 
such as a root, stem, leaf, flower bud, or embryo. 

"Plant tissue" as used herein means a group of plant cells organized into a structural 
and functional unit. Any tissue of a plant in plants or in culture is included. This term 
includes, but is not limited to, whole plants, plant organs, plant seeds, tissue culture and 
any groups of plant cells organized into structural and/or functional units. The use of this 
term in conjunction with, or in the absence of, any specific type of plant tissue as listed 
above or otherwise embraced by this definition is not intended to be exclusive of any other 
type of plant tissue. 

A "promoter" is an untranslated DNA sequence upstream of the coding region that 
contains the binding site for RNA polymerase II and initiates transcription of the DNA. The 
promoter region may also include other elements that act as regulators of gene expression. 

A "protoplast" is an isolated plant cell without a cell wall or with only parts of the cell 

wall. 

"Regulatory elements" refer to sequences involved in controlling the expression of a 
nucleotide sequence. Regulatory elements comprise a promoter operably linked to the 
nucleotide sequence of interest and termination signals. They also typically encompass 
sequences required for proper translation of the nucleotide sequence. 

In its broadest sense, the term "substantially similar", when used herein with respect 
to a nucleotide sequence, means a nucleotide sequence corresponding to a reference 
nucleotide sequence, wherein the corresponding sequence encodes a polypeptide having 
substantially the same structure and function as the polypeptide encoded by the reference 
nucleotide sequence, e.g. where only changes in amino acids not affecting the polypeptide 
function occur. Desirably the substantially similar nucleotide sequence encodes the 
polypeptide encoded by the reference nucleotide sequence. The percentage of identity 
between the substantially similar nucleotide sequence and the reference nucleotide 
sequence desirably is at least 80%, more desirably at least 85%, preferably at least 90%, 
more preferably at least 95%, still more preferably at least 99%. A nucleotide sequence 
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"substantially similar" to reference nucleotide sequence hybridizes to the reference 
nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 
50°C with washing in 2X SSC, 0.1% SDS at 50°C, more desirably in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in 1X SSC, 0.1% SDS at 
50°C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 
50°C with washing in 0.5X SSC, 0.1% SDS at 50°C, preferably in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in 0.1X SSC, 0.1% SDS at 
50°C, more preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPQ 4l 1 mM EDTA at 
50°C with washing in 0.1 X SSC, 0.1% SDS at 65°C. 

"Synthetic" refers to a nucleotide sequence comprising structural characters that are 
not present in the natural sequence. For example, an artificial sequence that resembles 
more closely the G+C content and the normal codon distribution of dicot and/or monocot 
genes is said to be synthetic. 

Transformation" is a process for introducing heterologous nucleic acid into a host 
cell or organism. In particular, "transformation" means the stable integration of a DNA 
molecule into the genome of an organism of interest. 

'Transformed / transgenic / recombinanf refer to a host organism such as a 
bacterium or a plant into which a heterologous nucleic acid molecule has been introduced. 
The nucleic acid molecule can be stably integrated into the genome of the host or the 
nucleic acid molecule can also be present as an extrachromosomal molecule. Such an 
extrachromosomal molecule can be auto-replicating. Transformed cells, tissues, or plants 
are understood to encompass not only the end product of a transformation process, but 
also transgenic progeny thereof. A "non-transformed", "non-transgenic", or "non- 
recombinanf host refers to a wild-type organism, e.g., a bacterium or plant, which does not 
contain the heterologous nucleic acid molecule. 

Nucleotides are indicated by their bases by the following standard abbreviations: 
adenine (A), cytosine (C), thymine (T), and guanine (G). Amino acids are likewise indicated 
by the following standard abbreviations: alanine (Ala; A), arginine (Arg; R), asparagine (Asn; 
N), aspartic acid (Asp; D), cysteine (Cys; C), glutamine (Gin; Q), glutamic acid (Glu; E) f 
glycine (Gly; G), histidine (His; H), isoleucine (He; I), leucine (Leu; L), lysine (Lys; K), 
methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine 
(Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V). Furthermore, (Xaa; X) 
represents any amino acid. 
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BRIEF DESCRIPTION OF THE SEQUENCES IN THE SEQUENCE LISTING 

SEQ ID NO:1 is the sequence of the approximately 9.7 kb DNA fragment comprised in 
pCIB9359-7 which comprises the following ORFs at the specified nucleotide positions: 



Name 


Start 


End 


Orf1 


412 


1665 


orf2 


1686 


2447 


orf3 


2758 


3318 


orf4 


3342 


4118 


orf5 


4515 


9269 



SEQ ID NO:2 is the sequence of the -46.4 kDa protein encoded by orf 1 of SEQ ID NO:1 . 
SEQ ID NO:3 is the sequence of the -28.1 kDa protein encoded by orf2 of SEQ ID NO:1. 
SEQ ID NO:4 is the sequence of the -20.7 kDa protein encoded by orf3 of SEQ ID NO:1 . 
SEQ ID NO:5 is the sequence of the -28.7 kDa protein encoded by orf4 of SEQ ID NO:1. 
SEQ ID NO:6 is the sequence of the -176 kDa protein encoded by orf5 of SEQ ID NO:1 . 
SEQ ID NOs:7-10 are oligonucleotides. 

SEQ ID NO:11 is the sequence of the approximately 38 kb DNA fragment comprised in 
pNOV2400, which comprises the following ORFs at the specified nucleotide positions 
(descending numbers and "C" indicates that the ORF is on the complementary strand): 



Name 


Start 


End 




ort7 


66 


1898 


(partial sequence) 


hph3 


2416 


9909 




orf 18 


3395 


2817 


C 


orf4 


9966 


14,633 




orf 19 


14,699 


15,007 




orf5 


15,171 


18,035 




orf22 


17,398 


17,072 


C 


orf 10 


19,167 


18,235 


C 


orf 14 


20,116 


19,385 


C 


orf 13 


20.963 


20,217 


C 


orf 11 


23,086 


22,172 


C 


hph2 


23,768 


31,336 




orf2 


31,393 


35,838 
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orf21 35,709 35,383 C 

orf16 36,661 36,032 C 

orf8 37,781 36,654 C 
SEQ ID NO: 11 also includes the following restriction sites, some of which are used in the 
subcloning steps set forth in Example 17: 



neSTrlciion one 


INUUIOVMIUG rUOlUUIMgl 


ArriW 


2835 


D nm U| 

oamni 


18 915 


PcmRl 
Do/ l to l 


1 1 ,350 


QeH 1fY7l 
oSlx 1 U / I 




Eag\ 


13,590; 31,481 


Eco721 


34,474 


MM 


2444; 5116; 9327; 26,204 


Noti 


13,589 


Pad 


9915; 23,353; 37,888 


Pvu\ 


8816 


Sap\ 


35,248 


SexAl 


28,946 


Sgfi 


8815 


Spel 


2157; 3769; 7831; 11,168 


Sph\ 


755 


Stu\ 


35,690 




21,443 



SEQ ID N0:12 is the sequence of the protein encoded by orf5 of SEQ ID NO:1 1 . 
SEQ ID NO:13 is the sequence of the protein encoded by hph2 of SEQ ID NO:11 . 
SEQ ID N0.14 is the sequence of the protein encoded by orf2 of SEQ ID NO:1 1 . 
SEQ ID NOs:15-22 are oligonucleotides. 

DEPOSITS 

The following material has been deposited with the Agricultural Research Sen/ice, 
Patent Culture Collection (NRRL), 1815 North University Street, Peoria, Illinois 61604, under 
the terms of the Budapest Treaty on the International Recognition of the Deposit of 
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Microorganisms for the Purposes of Patent Procedure. All restrictions on the availability of 
the deposited material will be irrevocably removed upon the granting of a patent. 

done Accession Number Date of Deposit 

pCIB9359-7 NRRL B-21835 September 17, 1997 

pNOV2400 NRRL B-30077 December 3, 1 998 

pNOV1001 NRRLB-30078 December 3, 1998 



Novel Nucleic Acid Sequences whose Expression Results i n Insecticidal Toxins 

This invention relates to nucleic acid sequences whose expression results in novel 
toxins, and to the making and using of the toxins to control insect pests. The nucleic acid 
sequences are derived from Photorhabdus luminescens, a member of the 
Enterobacteriaceae family. P. luminescens is a symbiotic bacterium of nematodes of the 
genus Heterorhabditis. The nematodes colonize insect larva, kill them, and their offspring 
feed on the dead larvae. The insecticidal activity is actually produced by the symbiotic P. 
luminescens bacteria. The inventors are the first to isolate the nucleic acid sequences of 
the present invention from P. luminescens (ATCC strain number 29999). The expression of 
the nucleic acid sequences of the present invention results in toxins that can be used to 
control Lepidopteran insects such as P/ufe//a xylostella (Diamondback Moth), Trichoplusia ni 
(Cabbage Looper), Ostrinia nubilalis (European Corn Borer), Heliothis virescens (Tobacco 
Budworm), Helicoverpa zea (Corn Earworm), Manduca sexta (Tobacco Hornworm), 
Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall Armyworm), as well 
as Coleopteran insects such as Diabrotica virgifera virgifera (Western Corn Rootworm), 
Diabrotica undecimpunctata howardi (Southern Corn Rootworm), Diabrotica longicomis 
barberi (Northern Com Rootworm), and Leptinotarsa decimlineata (Colorado Potato 
Beetle). 

In one preferred embodiment, the invention encompasses an isolated nucleic acid 
molecule comprising a nucleotide sequence substantially similar to the approximately 9.7 kb 
nucleic acid sequence set forth in SEQ ID NO:1 , whose expression results in insect control 
activity (further illustrated in Examples 1-11). Five open reading frames (ORFs) are present 
in the nucleic acid sequence set forth in SEQ ID NO:1, coding for proteins of predicted sizes 
45 kDa, 28 kDa, 21 kDA, 29 kDa, and 176 kDa. The five ORFs are arranged in an operon- 
like structure. When expressed in a heterologous host, the - 9.7 kb DNA fragment from P. 
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luminescens results in insect control activity against Lepidopterans such as Plutella 
xylostella (Diamondback Moth), Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis 
(European Corn Borer), Heliothis virescens (Tobacco Budworm), Helicoverpa zea (Corn 
Earworm), Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall 
Armyworm), showing that expression of the ~ 9.7 kb nucleotide sequence set forth in SEQ 
ID NO:1 is necessary and sufficient for such insect control activity. In a preferred 
embodiment, the invention encompasses a DNA molecule, whose expression results in an 
insecticidal toxin, which is deposited in the E. coli strain pCIB9359-7 (NRRL accession 
number B-21 835). 

In another preferred embodiment, the invention encompasses an isolated nucleic 
acid molecule comprising a nucleotide sequence substantially similar to the approximately 
38 kb nucleic acid fragment set forth in SEQ ID NO:1 1 and deposited in the E. coli strain 
pNOV2400 (NRRL accession number B-30077), whose expression results in insect control 
activity (see Examples 12-18). In a more preferred embodiment, the invention encompasses 
an isolated nucleic acid molecule comprising a nucleotide sequence substantially similar to 
the - 22 kb DNA fragment deposited in the £. coli strain pNOV1001 (NRRL accession 
number B-30078), whose expression results in insect control activity. In a most preferred 
embodiment, the invention encompasses isolated nucleic acid molecules comprising 
nucleotide sequences substantially similar to the three ORFs corresponding to nucleotides 
23,768-31,336 (hph2), 31,393-35,838 (orf2), and 15,171-18,035 (orf5) of the DNA fragment 
set forth in SEQ ID NO:1 1 , as well as the proteins encoded thereby. When co-expressed in 
a heterologous host, these three ORFs result in insect control activity against Lepidopterans 
such as Plutella xylostella (Diamondback Moth), Ostrinia nubilalis (European Corn Borer), 
and Manduca sexta (Tobacco Homworm), as well as against Coleopterans such as 
Diabrotica virgifera virgifera (Western Corn Rootworm), Diabrotica undecimpunctata 
howardi (Southern Corn Rootworm), and Leptinotarsa decimlineata (Colorado Potato 
Beetle), showing that co-expression of these three ORFs (hph2, orf2, and orf5) is necessary 
and sufficient for such insect control activity. 

The present invention also encompasses recombinant vectors comprising the nucleic 
acid sequences of this invention. In such vectors, the nucleic acid sequences are preferably 
comprised in expression cassettes comprising regulatory elements for expression of the 
nucleotide sequences in a host cell capable of expressing the nucleotide sequences. Such 
regulatory elements usually comprise promoter and termination signals and preferably also 
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comprise elements allowing efficient translation of polypeptides encoded by the nucleic 
acid sequences of the present invention. Vectors comprising the nucleic acid sequences 
are usually capable of replication in particular host cells, preferably as extrachromosomal 
molecules, and are therefore used to amplify the nucleic acid sequences of this invention in 
the host cells. In one embodiment, host cells for such vectors are microorganisms, such as 
bacteria, in particular E.coli. In another embodiment, host cells for such recombinant vectors 
are endophytes or epiphytes. A preferred host cell for such vectors is a eukaryotic cell, 
such as a yeast, a plant cell, or an insect cell. Plant cells such as maize cells are most 
preferred host cells. In another preferred embodiment, such vectors are viral vectors and 
are used for replication of the nucleotide sequences in particular host cells, e.g. insect cells 
or plant cells. Recombinant vectors are also used for transformation of the nucleotide 
sequences of this invention into host cells, whereby the nucleotide sequences are stably 
integrated into the DNA of such host cells. In one, such host cells are prokaryotic cells. In a 
preferred embodiment, such host cells are eukaryotic cells, such as yeast cells, insect cells, 
or plant cells. In a most preferred embodiment, the host cells are plant cells, such as maize 
cells. 

In preferred embodiments, the insecticidal toxins of the invention each comprise at 
least one polypeptide encoded by a nucleotide sequence of the invention. In another 
preferred embodiment, the insecticidal toxins are produced from a purified strain of P. 
luminescens, such the strain with ATTC accession number 29999. The toxins of the 
present invention have insect control activity when tested against insect pests in bioassays; 
and these properties of the insecticidal toxins are further illustrated in Examples 1-18. The 
insecticidal toxins desribed in the present invention are further characterized in that their 
molecular weights are larger than 6,000, as found by size fractionation experiments. The 
insecticidal toxins retain full insectidical activity after being stored at 4°C for 2 weeks. One 
is also shown to retain its full insecticidal activity after being freeze-dried and stored at 22°C 
for 2 weeks. However, the insecticidal toxins of the invention lose their insecticidal activity 
after incubation for 5 minutes at 1 00°C. 

In further embodiments, the nucleotide sequences of the invention can be modified 
by incorporation of random mutations in a technique known as in-vitro recombination or 
DNA shuffling. This technique is described in Stemmer et al., Nature 370: 389-391 (1994) 
and US Patent 5,605,793, which are incorporated herein by reference. Millions of mutant 
copies of a nucleotide sequence are produced based on an original nucleotide sequence of 
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this invention and variants with improved properties, such as increased insecticidal activity, 
enhanced stability, or different specificity or range of target insect pests are recovered. The 
method encompasses forming a mutagenized double-stranded polynucleotide from a 
template double-stranded polynucleotide comprising a nucleotide sequence of this 
invention, wherein the template double-stranded polynucleotide has been cleaved into 
double-stranded-random fragments of a desired size, and comprises the steps of adding to 
the resultant population of double-stranded random fragments one or more single or 
double-stranded oligonucleotides, wherein said oligonucleotides comprise an area of 
identity and an area of heterology to the double-stranded template polynucleotide; 
denaturing the resultant mixture of double-stranded random fragments and oligonucleotides 
into single-stranded fragments; incubating the resultant population of single-stranded 
fragments with a polymerase under conditions which result in the annealing of said single- 
stranded fragments at said areas of identity to form pairs of annealed fragments, said areas 
of identity being sufficient for one member of a pair to prime replication of the other, thereby 
forming a mutagenized double-stranded polynucleotide; and repeating the second and third 
steps for at least two further cycles, wherein the resultant mixture in the second step of a 
further cycle includes the mutagenized double-stranded polynucleotide from the third step 
of the previous cycle, and the further cycle forms a further mutagenized double-stranded 
polynucleotide. In a preferred embodiment, the concentration of a single species of double- 
stranded random fragment in the population of double-stranded random fragments is less 
than 1% by weight of the total DNA. In a further preferred embodiment, the template 
double-stranded polynucleotide comprises at least about 100 species of polynucleotides. In 
another preferred embodiment, the size of the double-stranded random fragments is from 
about 5 bp to 5 kb. In a further preferred embodiment, the fourth step of the method 
comprises repeating the second and the third steps for at least 10 cycles. 

Expression of the Nucleotide Sequences in Heterologous Microbial Hosts 

As biological insect control agents, the insecticidal toxins are produced by expression 
of the nucleotide sequences in heterologous host cells capable of expressing the nucleotide 
sequences. In a first embodiment, P. luminescens cells comprising modifications of at least 
one nucleotide sequence of this invention at its chromosomal location are described. Such 
modifications encompass mutations or deletions of existing regulatory elements, thus 
leading to altered expression of the nucleotide sequence, or the incorporation of new 
regulatory elements controlling the expression of the nucleotide sequence. In another 



WO 99/42589 



-19- 



PCT/EP99/01015 



embodiment, additional copies of one or more of the nucleotide sequences are added to P. 
luminescens cells either by insertion into the chromosome or by introduction of 
extrachromosomally replicating molecules containing the nucleotide sequences. 

In another embodiment, at least one of the nucleotide sequences of the invention is 
inserted into an appropriate expression cassette, comprising a promoter and termination 
signals. Expression of the nucleotide sequence is constitutive, or an inducible promoter 
responding to various types of stimuli to initiate transcription is used. In a preferred 
embodiment, the cell in which the toxin is expressed is a microorganism, such as a virus, a 
bacteria, or a fungus. In a preferred embodiment, a virus, such as a baculovirus, contains a 
nucleotide sequence of the invention in its genome and expresses large amounts of the 
corresponding insecticidal toxin after infection of appropriate eukaryotic cells that are 
suitable for virus replication and expression of the nucleotide sequence. The insecticidal 
toxin thus produced is used as an insecticidal agent. Alternatively, baculoviruses 
engineered to include the nucleotide sequence are used to infect insects in-vivo and kill 
them either by expression of the insecticidal toxin or by a combination of viral infection and 
expression of the insecticidal toxin. 

Bacterial ceils are also hosts for the expression of the nucleotide sequences of the 
invention. In a preferred embodiment, non-pathogenic symbiotic bacteria, which are able to 
live and replicate within plant tissues, so-called endophytes, or non-pathogenic symbiotic 
bacteria, which are capable of colonizing the phyllosphere or the rhizosphere, so-called 
epiphytes, are used. Such bacteria include bacteria of the genera Agrobacterium, 
Alcaligenes, Azospirillum, Azotobacter, Bacillus, Clavibacter, Enterobacter, Erwinia, 
Flavobacter, Klebsiella, Pseudomonas, Rhizobium, Serratia, Streptomyces and 
Xanthomonas. Symbiotic fungi, such as Trichoderma and Gliocladium are also possible 
hosts for expression of the inventive nucleotide sequences for the same purpose. 

Techniques for these genetic manipulations are specific for the different available 
hosts and are known in the art. For example, the expression vectors pKK223-3 and 
pKK223-2 can be used to express heterologous genes in E. coli t either in transcriptional or 
translational fusion, behind the tac or trc promoter. For the expression of operons encoding 
multiple ORFs, the simplest procedure is to insert the operon into a vector such as pKK223- 
3 in transcriptional fusion, allowing the cognate ribosome binding site of the heterologous 
genes to be used. Techniques for overexpression in gram-positive species such as Bacillus 
are also known in the art and can be used in the context of this invention (Quax et al. In.: 
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Industrial Microorganisms: Basic and Applied Molecular Genetics, Eds. Baltz et a/., 
American Society for Microbiology, Washington (1993)). Alternate systems for 
overexpression rely for example, on yeast vectors and include the use of Pichia, 
Saccharomyces and Kluyveromyces (Sreekrishna, In: Industrial microorganisms: basic and 
applied molecular genetics, Baltz, Hegeman, and Skatrud eds., American Society for 
Microbiology, Washington (1993); Dequin & Barre, Biotechnology 12:173-177 (1994); van 
den Berg et a/., Biotechnology 8:135-139 (1990)). 

In another preferred embodiment, at least one of the described nucleotide 
sequences is transferred to and expressed in Pseudomonas fluorescens strain CGA267356 
(described in the published application EU 0 472 494 and in WO 94/01561) which has 
biocontrol characteristics. In another preferred embodiment, a nucleotide sequence of the 
invention is transferred to Pseudomonas aureofaciens strain 30-84 which also has 
biocontrol characteristics. Expression in heterologous biocontrol strains requires the 
selection of vectors appropriate for replication in the chosen host and a suitable choice of 
promoter. Techniques are well known in the art for expression in gram-negative and gram- 
positive bacteria and fungi. 

Expression of the Nucleotide Sequences in Plant Tissue 

In a particularly preferred embodiment, at least one of the insecticidal toxins of the 
invention is expressed in a higher organism, e.g., a plant. In this case, transgenic plants 
expressing effective amounts of the toxins protect themselves from insect pests. When the 
insect starts feeding on such a transgenic plant, it also ingests the expressed toxins. This 
will deter the insect from further biting into the plant tissue or may even harm or kill the 
insect. A nucleotide sequence of the present invention is inserted into an expression 
cassette, which is then preferably stably integrated in the genome of said plant. In another 
preferred embodiment, the nucleotide sequence is included in a non-pathogenic self- 
replicating virus. Plants transformed in accordance with the present invention may be 
monocots or dicots and include, but are not limited to, maize, wheat, barley, rye, sweet 
potato, bean, pea, chicory, lettuce, cabbage, cauliflower, broccoli, turnip, radish, spinach, 
asparagus, onion, garlic, pepper, celery, squash, pumpkin, hemp, zucchini, apple, pear, 
quince, melon, plum, cherry, peach, nectarine, apricot, strawberry, grape, raspberry, 
blackberry, pineapple, avocado, papaya, mango, banana, soybean, tomato, sorghum, 
sugarcane, sugarbeet, sunflower, rapeseed, clover, tobacco, carrot, cotton, alfalfa, rice, 
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potato, eggplant, cucumber, Arabidopsis, and woody plants such as coniferous and 
deciduous trees. 

Once a desired nucleotide sequence has been transformed into a particular plant 
species, it may be propagated in that species or moved into other varieties of the same 
species, particularly including commercial varieties, using traditional breeding techniques. 

A nucleotide sequence of this invention is preferably expressed in transgenic plants, 
thus causing the biosynthesis of the corresponding toxin in the transgenic plants. In this 
way, transgenic plants with enhanced resistance to insects are generated. For their 
expression in transgenic plants, the nucleotide sequences of the invention may require 
modification and optimization. Although in many cases genes from microbial organisms can 
be expressed in plants at high levels without modification, low expression in transgenic 
plants may result from microbial nucleotide sequences having codons that are not preferred 
in plants. It is known in the art that all organisms have specific preferences for codon 
usage, and the codons of the nucleotide sequences described in this invention can be 
changed to conform with plant preferences, while maintaining the amino acids encoded 
thereby. Furthermore, high expression in plants is best achieved from coding sequences 
that have at least 35% about GC content, preferably more than about 45%, more preferably 
more than about 50%, and most preferably more than about 60%. Microbial nucleotide 
sequences which have low GC contents may express poorly in plants due to the existence 
of ATTTA motifs which may destabilize messages, and AATAAA motifs which may cause 
inappropriate polyadenylation. Although preferred gene sequences may be adequately 
expressed in both monocotyledonous and dicotyledonous plant species, sequences can be 
modified to account for the specific codon preferences and GC content preferences of 
monocotyledons or dicotyledons as these preferences have been shown to differ (Murray et 
al Nucl. Acids Res. 17: 477-498 (1989)). In addition, the nucleotide sequences are 
screened for the existence of illegitimate splice sites that may cause message truncation. 
All changes required to be made within the nucleotide sequences such as those described 
above are made using well known techniques of site directed mutagenesis, PCR, and 
synthetic gene construction using the methods described in the published patent 
applications EP 0 385 962 (to Monsanto), EP 0 359 472 (to Lubrizol, and WO 93/07278 (to 
Ciba-Geigy). 

For efficient initiation of translation, sequences adjacent to the initiating methionine 
may require modification. For example, they can be modified by the inclusion of sequences 
known to be effective in plants. Joshi has suggested an appropriate consensus for plants 
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(NAR 15: 6643-6653 (1987)) and Clontech suggests a further consensus translation 
initiator (1993/1994 catalog, page 210). These consensuses are suitable for use with the 
nucleotide sequences of this invention. The sequences are incorporated into constructions 
comprising the nucleotide sequences, up to and including the ATG (whilst leaving the 
second amino acid unmodified), or alternatively up to and including the GTC subsequent to 
the ATG (with the possibility of modifying the second amino acid of the transgene). 

Expression of the nucleotide sequences in transgenic plants is driven by promoters 
shown to be functional in plants. The choice of promoter will vary depending on the 
temporal and spatial requirements for expression, and also depending on the target 
species. Thus, expression of the nucleotide sequences of this invention in leaves, in ears, in 
inflorescences {e.g. spikes, panicles, cobs, etc.), in roots, and/or seedlings is preferred. In 
many cases, however, protection against more than one type of insect pest is sought, and 
thus expression in multiple tissues is desirable. Although many promoters from 
dicotyledons have been shown to be operational in monocotyledons and vice versa, ideally 
dicotyledonous promoters are selected for expression in dicotyledons, and 
monocotyledonous promoters for expression in monocotyledons. However, there is no 
restriction to the provenance of selected promoters; it is sufficient that they are operational 
in driving the expression of the nucleotide sequences in the desired cell. 

Preferred promoters that are expressed constitutively include promoters from genes 
encoding actin or ubiquitin and the CaMV 35S and 19S promoters. The nucleotide 
sequences of this invention can also be expressed under the regulation of promoters that 
are chemically regulated. This enables the insecticidal toxins to be synthesized only when 
the crop plants are treated with the inducing chemicals. Preferred technology for chemical 
induction of gene expression is detailed in the published application EP 0 332 104 (to Ciba- 
Geigy) and US patent 5,614,395. A preferred promoter for chemical induction is the 
tobacco PR-1 a promoter. 

A preferred category of promoters is that which is wound inducible. Numerous 
promoters have been described which are expressed at wound sites and also at the sites of 
phytopathogen infection. Ideally, such a promoter should only be active locally at the sites 
of infection, and in this way the insecticidal toxins only accumulate in cells which need to 
synthesize the insecticidal toxins to kill the invading insect pest. Preferred promoters of this 
kind include those described by Stanford etal. Mol. Gen. Genet. 215: 200-208 (1989), Xu et 
a/. Plant Molec. Biol. 22: 573-588 (1993), Logemann et a/. Plant Cell 1: 151-158 (1989), 
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Rohrmeier & Lehle, Plant Molec. Biol. 22: 783-792 (1993), Firek et al. Plant Molec. Biol. 22: 
129-142 (1993), and Warner etal. Plant J. 3: 191-201 (1993). 

Preferred tissue specific expression patterns include green tissue specific, root 
specific, stem specific, and flower specific. Promoters suitable for expression in green 
tissue include many which regulate genes involved in photosynthesis and many of these 
have been cloned from both monocotyledons and dicotyledons. A preferred promoter is the 
maize PEPC promoter from the phosphoenol carboxylase gene (Hudspeth & Grula, Plant 
Molec. Biol. 12: 579-589 (1989)). A preferred promoter for root specific expression is that 
described by de Framond (FEBS 290: 103-106 (1991); EP 0 452 269 to Ciba-Geigy). A 
preferred stem specific promoter is that described in US patent 5,625,136 (to Ciba-Geigy) 
and which drives expression of the maize trpA gene. 

Especially preferred embodiments of the invention are transgenic plants expressing 
at least one of the nucleotide sequences of the invention in a root-preferred or root-specific 
fashion. Further preferred embodiments are transgenic plants expressing the nucleotide 
sequences in a wound-inducible or pathogen infection-inducible manner. 

In addition to the selection of a suitable promoter, constructions for expression of an 
insecticidal toxin in plants require an appropriate, transcription terminator to be attached 
downstream of the heterologous nucleotide sequence. Several such terminators are 
available and known in the art (e.g. tm1 from CaMV, E9 from rbcS). Any available 
terminator known to function in plants can be used in the context of this invention. 

Numerous other sequences can be incorporated into expression cassettes 
described in this invention. These include sequences which have been shown to enhance 
expression such as intron sequences (e.g. from Adh1 and bronzel) and viral leader 
sequences (e.g. from TMV, MCMV and AMV). 

It may be preferable to target expression of the nucleotide sequences of the present 
invention to different cellular localizations in the plant. In some cases, localization in the 
cytosol may be desirable, whereas in other cases, localization in some subcellular organelle 
may be preferred. Subcellular localization of transgene encoded enzymes is undertaken 
using techniques well known in the art. Typically, the DNA encoding the target peptide from 
a known organelle-targeted gene product is manipulated and fused upstream of the 
nucleotide sequence. Many such target sequences are known for the chloroplast and their 
functioning in heterologous constructions has been shown. The expression of the 
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nucleotide sequences of the present invention is also targeted to the endoplasmic reticulum 
or to the vacuoles of the host cells. Techniques to achieve this are well-known in the art. 

Vectors suitable for plant transformation are described elsewhere in this 
specification. For Agrobacterium-medlaled transformation, binary vectors or vectors 
carrying at least one T-DNA border sequence are suitable, whereas for direct gene transfer 
any vector is suitable and linear DNA containing only the construction of interest may be 
preferred. In the case of direct gene transfer, transformation with a single DNA species or 
co-transformation can be used (Schocher et al. Biotechnology 4: 1093-1096 (1986)). For 
both direct gene transfer and Agrobacterium-mediated transfer, transformation is usually 
(but not necessarily) undertaken with a selectable marker which may provide resistance to 
an antibiotic (kanamycin, hygromycin or methotrexate) or a herbicide (basta). The choice of 
selectable marker is not, however, critical to the invention. 

In another preferred embodiment, a nucleotide sequence of the present invention is 
directly transformed into the plastid genome. A major advantage of plastid transformation is 
that plastids are generally capable of expressing bacterial genes without substantial 
modification, and plastids are capable of expressing multiple open reading frames under 
control of a single promoter. Plastid transformation technology is extensively described in 
U.S. Patent Nos. 5,451,513, 5,545,817, and 5,545,818, in PCT application no. WO 
95/16783, and in McBride etal. (1994) Proc. Natl. Acad. Sci. USA 91, 7301-7305. The basic 
technique for chloroplast transformation involves introducing regions of cloned plastid DNA 
flanking a selectable marker together with the gene of interest into a suitable target tissue, 
e.g., using biolistics or protoplast transformation (e.g., calcium chloride or PEG mediated 
transformation). The 1 to 1.5 kb flanking regions, termed targeting sequences, facilitate 
homologous recombination with the plastid genome and thus allow the replacement or 
modification of specific regions of the plastome. Initially, point mutations in the chloroplast 
16S rRNA and rps12 genes conferring resistance to spectinomycin and/or streptomycin are 
utilized as selectable markers for transformation (Svab, Z., Hajdukiewicz, P., and Maliga, P. 
(1990) Proc. Natl. Acad. Sci. USA 87, 8526-8530; Staub, J. M., and Maliga, P. (1992) Plant 
Cell 4, 39-45). This resulted in stable homoplasmic transformants at a frequency of 
approximately one per 100 bombardments of target leaves. The presence of cloning sites 
between these markers allowed creation of a plastid targeting vector for introduction of 
foreign genes (Staub, J.M.. and Maliga, P. (1993) EMBO J. 12, 601-606). Substantial 
increases in transformation frequency are obtained by replacement of the recessive rRNA 
or r-protein antibiotic resistance genes with a dominant selectable marker, the bacterial 
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aadA gene encoding the spectinornycin-detoxifying enzyme aminoglycoside-3'- 
adenyltransferase (Svab, Z., and Maliga, P. (1993) Proc. Natl. Acad. Sci. USA 90, 913-917). 
Previously, this marker had been used successfully for high-frequency transformation of the 
plastid genome of the green alga Chfamydomonas reinhardtii (Goldschmidt-Clermont, M. 
(1991) Nucl. Acids Res. 19: 4083-4089). Other selectable markers useful for plastid 
transformation are known in the art and encompassed within the scope of the invention. 
Typically, approximately 15-20 cell division cycles following transformation are required to 
reach a homoplastidic state. Plastid expression, in which genes are inserted by homologous 
recombination into all of the several thousand copies of the circular plastid genome present 
in each plant cell, takes advantage of the enormous copy number advantage over nuclear- 
expressed genes to permit expression levels that can readily exceed 10% of the total 
soluble plant protein. In a preferred embodiment, a nucleotide sequence of the present 
invention is inserted into a plastid targeting vector and transformed into the plastid genome 
of a desired plant host. Plants homoplastic for plastid genomes containing a nucleotide 
sequence of the present invention are obtained, and are preferentially capable of high 
expression of the nucleotide sequence. 

Formulation of Insecticidal Compositions 

The invention also includes compositions comprising at least one of the insecticidal 
toxins of the present invention. In order to effectively control insect pests such compositions 
preferably contain sufficient amounts of toxin. Such amounts vary depending on the crop to 
be protected, on the particular pest to be targeted, and on the environmental conditions, 
such as humidity, temperature or type of soil. In a preferred embodiment, compositions 
comprising the insecticidal toxins comprise host cells expressing the toxins without 
additional purification. In another preferred embodiment, the cells expressing the 
insecticidal toxins are lyophilized prior to their use as an insecticidal agent. In another 
embodiment, the insecticidal toxins are engineered to be secreted from the host cells. In 
cases where purification of the toxins from the host cells in which they are expressed is 
desired, various degrees of purification of the insecticidal toxins are reached. 

The present invention further embraces the preparation of compositions comprising 
at least one insecticidal toxin of the present invention, which is homogeneously mixed with 
one or more compounds or groups of compounds described herein. The present invention 
also relates to methods of treating plants, which comprise application of the insecticidal 
toxins or compositions containing the insecticidal toxins, to plants. The insecticidal toxins 
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can be applied to the crop area in the form of compositions or plant to be treated, 
simultaneously or in succession, with further compounds. These compounds can be both 
fertilizers or micronutrient donors or other preparations that influence plant growth. They 
can also be selective herbicides, insecticides, fungicides, bactericides, nematicides, 
molluscicides or mixtures of several of these preparations, if desired together with further 
carriers, surfactants or application-promoting adjuvants customarily employed in the art of 
formulation. Suitable carriers and adjuvants can be solid or liquid and correspond to the 
substances ordinarily employed in formulation technology, e.g. natural or regenerated 
mineral substances, solvents, dispersants, wetting agents, tackifiers, binders or fertilizers. 

A preferred method of applying insecticidal toxins of the present invention is by 
spraying to the environment hosting the insect pest like the soil, water, or foliage of plants. 
The number of applications and the rate of application depend on the type and intensity of 
infestation by the insect pest. The insecticidal toxins can also penetrate the plant through 
the roots via the soil (systemic action) by impregnating the locus of the plant with a liquid 
composition, or by applying the compounds in solid form to the soil, e.g. in granular form 
(soil application). The insecticidal toxins may also be applied to seeds (coating) by 
impregnating the seeds either with a liquid formulation containing insecticidal toxins, or 
coating them with a solid formulation. In special cases, further types of application are also 
possible, for example, selective treatment of the plant stems or buds. The insecticidal 
toxins can also be provided as bait located above or below the ground. 

The insecticidal toxins are used in unmodified form or, preferably, together with the 
adjuvants conventionally employed in the art of formulation, and are therefore formulated in 
known manner to emulsifiable concentrates, coatable pastes, directly sprayable or dilutable 
solutions, dilute emulsions, wettable powders, soluble powders, dusts, granulates, and also 
encapsulations, for example, in polymer substances. Like the nature of the compositions, 
the methods of application, such as spraying, atomizing, dusting, scattering or pouring, are 
chosen in accordance with the intended objectives and the prevailing circumstances. 

The formulations, compositions or preparations containing the insecticidal toxins 
and, where appropriate, a solid or liquid adjuvant, are prepared in known manner, for 
example by homogeneously mixing and/or grinding the insecticidal toxins with extenders, 
for example solvents, solid carriers and, where appropriate, surface-active compounds 
(surfactants). 

Suitable solvents include aromatic hydrocarbons, preferably the fractions having 8 to 
12 carbon atoms, for example, xylene mixtures or substituted naphthalenes, phthalates 
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such as dibutyl phthalate or dioctyl phthalate, aliphatic hydrocarbons such as cyclohexane 
or paraffins, alcohols and glycols and their ethers and esters, such as ethanol, ethylene 
glycol monomethyl or monoethyl ether, ketones such as cyclohexanone, strongly polar 
solvents such as N-methyl-2-pyrrolidone, dimethyl sulfoxide or dimethyl formamide, as well 
as epoxidized vegetable oils such as epoxidized coconut oil or soybean oil or water. 

The solid carriers used e.g. for dusts and dispersible powders, are normally natural 
mineral fillers such as calcite, talcum, kaolin, montmorillonite or attapulgite. In order to 
improve the physical properties it is also possible to add highly dispersed silicic acid or 
highly dispersed absorbent polymers. Suitable granulated adsorptive carriers are porous 
types, for example pumice, broken brick, sepiolite or bentonite; and suitable nonsorbent 
carriers are materials such as calcite or sand. In addition, a great number of pregranulated 
materials of inorganic or organic nature can be used, e.g. especially dolomite or pulverized 
plant residues. 

Suitable surface-active compounds are nonionic, cationic and/or anionic surfactants 
having good emulsifying, dispersing and wetting properties. The term "surfactants" will also 
be understood as comprising mixtures of surfactants. Suitable anionic surfactants can be 
both water-soluble soaps and water-soluble synthetic surface-active compounds. 

Suitable soaps are the alkali metal salts, alkaline earth metal salts or unsubstituted 
or substituted ammonium salts of higher fatty acids (chains of 10 to 22 carbon atoms), for 
example the sodium or potassium salts of oleic or stearic acid, or of natural fatty acid 
mixtures which can be obtained for example from coconut oil or tallow oil. The fatty acid 
methyltaurin salts may also be used. 

More frequently, however, so-called synthetic surfactants are used, especially fatty 
sulfonates, fatty sulfates, sulfonated benzimidazole derivatives or alkylarylsulfonates. 

The fatty sulfonates or sulfates are usually in the form of alkali metal salts, alkaline 
earth metal salts or unsubstituted or substituted ammonium salts and have a 8 to 22 carbon 
alkyl radical which also includes the alkyl moiety of alkyl radicals, for example, the sodium 
or calcium salt of lignonsulfonic acid, of dodecylsulfate or of a mixture of fatty alcohol 
sulfates obtained from natural fatty acids. These compounds also comprise the salts of 
sulfuric acid esters and sulfonic acids of fatty alcohol/ethyiene oxide adducts. The 
sulfonated benzimidazole derivatives preferably contain 2 sulfonic acid groups and one fatty 
acid radical containing 8 to 22 carbon atoms. Examples of alkylarylsulfonates are the 
sodium, calcium or triethanolamine salts of dodecylbenzenesulfonic acid, 
dibutylnapthalenesulfonic acid, or of a naphthalenesulfonic acid/formaldehyde 
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condensation product. Also suitable are corresponding phosphates, e.g. salts of the 
phosphoric acid ester of an adduct of p-nonylphenol with 4 to 14 moles of ethylene oxide. 

Non-ionic surfactants are preferably polyglycol ether derivatives of aliphatic or 
cycloaliphatic alcohols, or saturated or unsaturated fatty acids and alkylphenols, said 
derivatives containing 3 to 30 glycol ether groups and 8 to 20 carbon atoms in the (aliphatic) 
hydrocarbon moiety and 6 to 18 carbon atoms in the alkyl moiety of the alkylphenols. 

Further suitable non-ionic surfactants are the water-soluble adducts of polyethylene 
oxide with polypropylene glycol, ethylenediamine propylene glycol and alkylpolypropylene 
glycol containing 1 to 10 carbon atoms in the alkyl chain, which adducts contain 20 to 250 
ethylene glycol ether groups and 10 to 100 propylene glycol ether groups. These 
compounds usually contain 1 to 5 ethylene glycol units per propylene glycol unit. 

Representative examples of non-ionic surfactants are 
nonylphenolpolyethoxyethanols, castor oil polyglycol ethers, polypropylene/polyethylene 
oxide adducts, tributylphenoxypoiyethoxyethanoi, polyethylene glycol and 
octylphenoxyethoxyethanol. Fatty acid esters of polyoxyethylene sorbitan and 
polyoxyethylene sorbitan trioleate are also suitable non-ionic surfactants. 

Cationic surfactants are preferably quaternary ammonium salts which have, as N- 
substituent, at least one C8-C22 alkyl radical and, as further subetituents, lower 
unsubstituted or halogenated alkyl, benzyl or lower hydroxyalkyl radicals. The salts are 
preferably in the form of halides, methylsulfates or ethylsulfates, e.g. 
stearyltrimethylammonium chloride or benzyldi(2-chloroethyl)ethylammonium bromide. 

The surfactants customarily employed in the art of formulation are described, for 
example, in "McCutcheon's Detergents and Emulsifiers Annual," MC Publishing Corp. 
Ringwood, New Jersey, 1979, and Sisely and Wood, "Encyclopedia of Surface Active 
Agents," Chemical Publishing Co., Inc. New York, 1980. 
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EXAMPLES 

The invention will be further described by reference to the following detailed 
examples. These examples are provided for purposes of illustration only, and are not 
intended to be limiting unless otherwise specified. Standard recombinant DNA and 
molecular cloning techniques used here are well known in the art and are described by 
Ausubel (ed.), Current Protocols in Molecular Biology, John Wiley and Sons, Inc. (1994); T. 
Maniatis, E. F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor laboratory, Cold Spring Harbor, NY (1989); and by TJ. Silhavy, M.L. Berman, 
and L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1984). 

A. Isolation Of Nucleotide Sequences Whose Expression Results In Toxins Active 
Against Leoidopteran insects 

Example 1 : Construction of Cosmid Library from Photorhabdus luminescens 

Photorhabdus luminescens strain ATCC 29999 is grown in nutrient broth at 25°C for 
three days as described in the ATCC protocol for bioassay. The culture is grown for 24 
hours for DNA isolation. Total DNA is isolated by treating freshly grown cells resuspended 
in 100 mM Tris pH 8, 10 mM EDTA with 2 mg/ml lysozyme for 30 minutes at 37°C. 
Proteinase K is added to a final concentration of 100 mg/ml, SDS is added to a final 
concentration of 0.5% SDS and the sample is incubated at 45°C. After the solution 
becomes clear and viscous, the SDS concentration is raised to 1%, and 300 mM NaCI and 
an equal volume of phenol-chloroform-isoamyl alcohol are added, mixed gently for 5 
minutes and centrifuged at 3K. The phenol-chloroform-isoamyl alcohol extraction is 
repeated twice. The aqueous phase is mixed with 0.7 volumes isopropanol, and the sample 
is centrifuged. The pellet is washed 3 times with 70% ethanol and the nucleic acids are 
gently resuspended in 0.5X TE. 

The DNA is treated with 0.3 units of Sau3A per mg DNA at 37°C for 3.5 minutes in 
100 ml volume containing a total of 6 mg DNA. The reaction is then heated for 30 minutes 
at 65°C to inactivate the enzyme. Then 2 units of Calf Intestinal Alkaline Phosphatase are 
added and incubated for 30 minutes at 37°C. The sample is mixed with an equal volume of 
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phenol-chloroform-isoamyl alcohol and centrifuged. The aqueous phase is removed, 
precipitated with 0.7 volume isopropanol and centrifuged. The supernatant is transferred to 
a fresh tube, precipitated with ethanol, and the nucleic acids are resuspended in 0.5X TE at 
a concentration of 100 hg/ml. 

SuperCos cosmid vector (Stratagene, La Jolla, CA) is prepared as described by the 
supplier utilizing the BamHl cloning site. Prepared SuperCos at 100 hg/ml is ligated with the 
Sau3A digested P.luminescens DNA at a molar ratio of 2:1 in a 5 ml volume overnight at 
6°C. The ligation mixture is packaged using Gigapack XL III (Stratagene), as described by 
the supplier. Packaged phages are used to infect XL-1MR (Stratagene) cells as described 
by the supplier. The cosmid library is plated on L-agar with 50 mg/ml kanamycin and 
incubated 16 hours at 37°C. 500 colonies are patched onto fresh L-kan plates at 50 
colonies per plate. From the other plates the cells are washed off with L broth and mixed 
with 20% glycerol and frozen at -80°C. 

Example 2: Insect Bioassays 

Plutella xylostella bioassays are performed by aliquoting of 50 \i\ of the E. coli culture 
on the solid artificial Plutella xylostella diet (Biever and Boldt, Annals of Entomological 
Society of America, 1971; Shelton et aL, J. Ent Sci. 26:17). 4 ml of the diet is poured into 1 
oz. clear plastic cups (Bioserve product #9051). 5 neonate P. xylostella from a diet adapted 
lab colony are placed in each diet-containing cup and then covered with a white paper lid 
(Bioserve product #9049). 10 larvae are assayed per concentration. Trays of cups are 
placed in an incubator for 3 days at 72°F with a 14:10 (hours) lightdark cycle. Then, the 
number of live larvae in each cup is recorded. Bioassays for other insects are performed as 
described for Plutella xylostella, but using the diet required by the insect to be tested. 

The broth of P. luminescens undiluted and diluted 1:100 gives 100% mortality 
against P. xylostella. The broth of P. luminescens also gives 100% mortality against 
Diabrotica virgifera virgifera. Three clones with activity against P. xylostella and Heliothis 
virescens are obtained after screening 500 £ coli clones by insect bioassay. These cosmid 
clones are given the numbers pCIB9349, pCIB9350, and pCIB9351. 
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Example 3: Isolation of the Nucleotide Sequence Responsible for Insect Control 
Activity from Clones pCIB9349, pCIB9350, and pCIB9351 

The three clones pCIB9349, pC!B9350 and pCIB9351 are found to be overlapping 
cosmids by restriction enzyme mapping. After digestion with Pad, clones pCIB9349 and 
pCIB9351 give two DNA fragments each, and pCIB9350 gives three DNA fragments. Each 
fragment is isolated and is self-ligated. The enzyme Pad does not cut the SuperCos vector; 
therefore, only fragments linked to it are re-isolated. The ligation mixtures are transformed 
into DH5a £ coli cells. Isolated transformed bacterial colonies are grown in L broth with 50 
\ig/rr\\ kanamycin, and plasmid DNA is isolated by using the alkaline miniprep protocol as 
described in Sambrook, et al. DNA is digested with Notl/Pacl and two clones, pCIB9355 
and pCIB9356, are found by bioassay to still contain the insecticidal activity. Clone 
pCIB9355 is digested with Afofl and a 17 kb and a 4 kb DNA fragment are generated. The 
17 kb fragment is isolated and ligated into Bluescript vector previously cut with A/ofl and 
transformed into DH5<x E. coli ceils. The isolated transformed bacterial colonies are grown 
as described and plasmid DNA is isolated by the alkaline miniprep protocol. A clone 
containing the 17 kb insert is named pCIB9359 and tested by bioassay. The results are 
shown in Example 5. 3 |ig of the 17 kb insert is isolated and treated with 0.3 unit of Sau3A 
per fig DNA for 4, 6, and 8 minutes at 37°C, heated at 75°C for 15 minutes. The samples 
are pooled and ligated into pUC1 9 previously cut with BamHI and treated with calf intestinal 
alkaline phosphatase. The ligation is transformed into DH5a cells and plated on L agar with 
Xgal/Amp as described in Sambrook et al. and grown overnight at 37°C. White colonies 
are picked and grown in L broth with 100 ^g/ml and plasmid DNA is isolated as previously 
described. DNA is digested with EcoRI/Hindlll and novel restriction patterns are sequenced. 
Sequencing primers are ordered from Genosys Biotechnologies (Woodlands, TX). 
Sequencing is performed using the dideoxy chain-termination method. Sequencing is 
completed using Applied Biosystems Inc. model 377 automated DNA sequencer (Foster 
City, CA). Sequence is assembled using 3.0 from Gene Codes Corporation (Ann Arbor, 
Ml). 
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Example 4: Subctoning of the 9.7 kb EcoRI/Xbal Fragment From pCIB9359 

pCIB9359 is digested with EcoRI and Xbal and the DNA is run on a 0.8% 
Seaplaque/TBE gel. The 9.7 kb fragment (SEQ ID NO:1) is isolated and ligated into pUC19 
previously digested with EcoRI and Xbal The ligation mixture is transformed into DH5a E. 
coli cells. Transformed bacteria are grown and plasmid DNA is isolated as previously 
described. The vector containing the 9.7 kb fragment in pUC19 is designated pCIB9359-7 
and bioassay results are shown in Example 5. 

Example 5: Bioassay Results for Cosmid Clones pCIB9359 and pCIB9359-7 



Cultures of E. coli strains 9359 and 9359-7 containing clones pCIB9359 and 
pCIB9359-7, respectively, are tested for insecticidal activity against the following insects in 
insect bioassays: 



Insects 


Clones 




PCIB9359 and pCIB9359-7 


Plutella xylostella (Diamondback Moth (DBM)) 


+++ 


Hellothis virescens (Tobacco Budworm (TBW)) 


++ 


Helicoverpa zea (Corn Earworm (CEW)) 


+++ 


Spodoptera exigua (Beet Armyworm (BAW)) 


+ 


Spodoptera frugiperda (Fall Armyworm (FAW)) 


+ 


Trichoplusia ni (Cabbage Looper (CL)) 


+++ 


Ostrinia nubilalis (European Com Borer (ECB)) 


++ 


Manduca sexta (Tobacco Hornworm (THW) 


na 


Diabrotica virgifera (Western Com Rootworm (WCR)) 


na 


Agrotis ipsilon (Black Cutworm (BCW)) 


na 



na = not active 

+ = significant growth inhibition 

++ = >40% mortality, but less than 100% 

+++ = 100% mortality 
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The clones show insecticidal activity against P. xylostella, H. virescens, H. zea, T. ni, 
and O. nubilalis, and significant insect control activity against S. exigua and S. frugiperda. 

Example 6: Identification of Active Region of pCIB9359-7 By Subcioning 

Cultures of E. coii strains containing subclones of pCIB9359-7 are tested for 
insecticidal activity in insect bioassays against P. xylostella. 



Restriction 


Nucleotide Position Relative to 9.7 kb 


Insecticidal Activity Against 


Fragment 


EcoRUXbal fragment (SEQ ID NO:1) 


Plutella xylostella 




from pCIB9539-7 and Size in kb 




EcoRI/Xbal 


1 to 9712 


9.7 kb 


+++ 


EcoRV 


(-912) to 2309 


3.2 kb 


na 


Hindlll 


665 to 5438 


4.7 kb 


na 


Kpnl 


1441 to 8137 


6.9 kb 


na 


Sacl/Xbal 


2677 to 9712 


7.0 kb 


na 



na = not active 

+ = significant growth inhibition 

++ = >40% mortality, but less than 100% 

+++ = 100% mortality 

Example 7: Characterization of pCIB9359-7 Insect Control Activity By Titration 

Dilutions of a culture of E.coli strain 9359-7 containing pCIB9359-7 are tested for 
insecticidal activity in insect bioassays. Dilutions are prepared in a culture of E.coli XL-1 in a 
total volume of 100 \x\ and are transferred to diet cups with 5 insects per cup. The results 
show the percentage (%) of insect mortality. 
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111 93S9-7 Culture 


Px 


Hv 


Hz 


Tn 


100 


100 


72 


48 


100 


50 


100 


84 


68 


92 


25 


100 


52 


32 


100 


12.5 


96 


52 


36 


68 


6.25 


88 


20 


4 


32 


0 


36 


20 


24 


0 



Px= P. xylostella, Hv= H. virescens, Hz= H. zea, Tn = T. ni 

Cultures of £ coli 9359-7 still show substantial insecticidal activity after dilution. 

Example 8: Stability of pCIB9359-7 Activity 

The stability of the toxins is tested after storage for 2 weeks at different 
temperatures and conditions. 300 ml of Luria broth containing 100 (fig/ml ampicillin is 
inoculated with £ coli strain 9359-7 and grown overnight at 37°C. Samples are placed in 
sterile 15 ml screw cap tubes and stored at 22°C and 4°C. Another sample is centrifuged; 
the supernatant is removed, freeze dried and stored at 22°C. The samples are stored under 
these conditions for 2 weeks and then a bioassay is conducted against P. xylostella. The 
freeze dried material is resuspended in the same volume as before. All samples are 
resuspended by vortexing. 



Conditions 


Results 


22°C (2 weeks) 


+++ 


4°C (2 weeks) 


+++ 


Freeze Dried (2 weeks) 


+++ 



na = not active; + = significant growth inhibition; ++ = >40% mortality, but less than 100%; 
+++ = 1 00% mortality 
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This demonstrates that the toxins retain their activity for at least two weeks at 22°C, 
4°C, and freeze-dried t and are therefore very stable. 

Example 9: Size Fraction of pCIB9359-7 Activity 

The approximate sizes of the insecticidal toxins are determined. P. luminescens 
cosmid clones pCIB9359-7 and pUC19 in E. coll host DH5a are grown in media consisting 
of 50% Terrific broth and 50% Luria broth, supplemented with 50 |ig/ml ampicillin. Cultures 
(three tubes of each strain) are inoculated into 3 ml of the above media in culture tubes and 
incubated on a roller wheel overnight at 37°C. Cultures of each strain are combined and 
sonicated using a Branson Model 450 Sonicator, micro tip, for approximately six 10 second 
cycles with cooling on ice between cycles. The sonicates are centrifuged in a Sorvall SS34 
rotor at 6000 RPM for 10 minutes. The resultant supernatants are filtered through a 0.2 p 
filter. The 3 ml fractions of the filtrates are applied to Bio-Rad Econo-Pac 10DG columns 
that have been previously equilibrated with 10 ml of 50mM NaCI, 25 mM Tris base, pH 7.0. 
The flow through collected during sample loading is discarded. The samples are 
fractionated with two subsequent additions of 4 ml each of the NaCI - Tris equilibration 
buffer. The two four ml fractions are saved for testing. The first fraction contains all 
material above about 6,000 mol. wt; the second fraction contains material smaller than 
6,000 moi. wt. A sample of the whole culture broth, the sonicate, and the filtered 
supernatant on the sonicate are tested along with the three fractions from the 10DG column 
for activity on P. xylostella neonates in bioassays. 

The culture, the sonicate, and the filtered supernatant of the sonicate, and the first 
column fraction from the 9359-7 sample are highly active on P. xylostella. The second 
column fraction from 9359-7 is slightly active (some stunting only). No activity is found in 
the third fraction from 9359-7. The sample from DH5-pUC19 does not have any activity. 
This indicates that the molecular weights of the toxins are above 6,000. 

Example 1 0: Heat Inactivitation of pCIB9359-7 Activity 



The heat stability of the toxins is determined. Overnight cultures of the E. coli strain 
pCIB9359-7 are grown in a 50:50 mixture of Luria broth and Terrific broth. Cultures are 
grown at 37°C in culture tubes on a tube roller. A one ml sample of the culture is placed in 
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a 1.5 mi eppendorf tube and placed in a boiling water bath. The sample is removed after 
five minutes and allowed to cool to room temperature. This sample along with an untreated 
portion of the culture is assayed on P. xylostella. 50pl of sample of sample is spread on 
diet, allowed to dry and neonate larvae P. xylostella applied to the surface. The assay is 
incubated for 5 days at room temperature. 

The untreated sample causes 100% mortality. The heat treated sample and a diet 
alone control do not cause any observable mortality, showing the toxins are heat sensitive. 

Example 11: Leaf Dip Bioassay of pCIB9359-7 

Insecticidal activity of the toxins is tested in a leaf dip bioassay. Six leaves 
approximately 2cm in diameter each are cut from seedlings of turnip and placed in a 1oz. 
plastic cup (Jet Plastica) with 4ml-5ml of the resuspended toxin, covered tightly, and shaken 
until thoroughly wetted. The treated leaves are placed in 50mm petri dishes (Gelman 
Sciences) on absorbent pads moistened with 300|il of water. The dish covers are left open 
until the leaf surface appears dry and then placed on tightly so that the leaves do not dry 
out. 

Ten neonate P. xylostella larvae are placed in each petri dish arena. Also, a 
treatment of 0.1% Bond spreader/sticker with no toxin is set up as a control. The arenas 
are monitored daily for signs of drying leaves, and water is added or leaves replaced if 
necessary. After 3 days the leaves and arenas are examined under a dissecting 
microscope, and the number of live larvae in each arena is recorded. 

100% mortality is found for 9359-7 and none in the no-toxin control, showing that the 
toxins are also insecticidal in a leaf dip assay. 

B. isolation Of Nucleic Acid Sequences Whose Expression Results In Toxins Active 
Against Lepidooteran and Coleooteran Insects 

Example 12: Total DNA Isolation from Photorhabdus luminescens 

Photorhabdus luminescens strain ATCC 29999 is grown 14-18 hours in L broth. 
Total DNA is isolated from 1.5 mis of culture resuspended in 0.5% SDS, 100|ig/ml 
proteinase K, TE to a final volume of 600 After a 1 hour incubation at 37°C, 100|il 5M 
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NaCI and 80^1 CTAB/NaCI are added and the culture is incubated at 65°C for 10 minutes. 
An equal volume of chloroform is added; the culture is mixed gently and spun. The 
aqueous phase is extracted once with phenol and once with chloroform. The nucleic acids 
are treated with 10 \ig RNase A for 30 minutes at room temperature. The aqueous phase is 
mixed with 0.6 volumes isopropanol and the sample is centrifuged. The pellet is washed 
once with 70% ethanol and the nucleic acids are gently resuspended in 100-200ul TE. 

Example 13: PCR Amplification of Probes 

Two probes are PCR amplified from Photorhabdus luminescens strain ATCC 29999 
genomic DNA using oligos 5'-ACACAGCAGGTTCGTCAG-3' (SEQ ID NO:7) and 5'- 
GGCAGAAGCACTCAACTC-3* (SEQ ID NO:8) to amplify probe #1 and oligos 5'- 
ATTGATAGCACGCGGCGACC-3' (SEQ ID NO:9) and 5'- 

TTGTAACGTGGAGCCGAACTGG-3' (SEQ ID NO:10) to amplify probe #2. The oligos are 
ordered from Genosys Biotechnologies, Inc. (Texas). Approximately 10-50 ng of genomic 
DNA is used as the template. 0.8]iM of oligos, 200\M of dNTPs, 1X Taq DNA Polymerase 
buffer and 2.5 units of Taq DNA Polymerase are included in the reaction. The reaction 
conditions are as follows: 
94°C - 1 minute 

94°C - 30 seconds / 60°C - 30 seconds / 72°C - 30 seconds (25 cycles) 
72°C - 5 minutes 
4°C - indefinite soak 

The reactions are preferably carried out in a PCR System 9600 (Perkin Elmer) 
thermocycler. 

Example 14: Probing a Photorhabdus luminescens Library 

600 clones from the P. luminescens cosmid library described in Example 1 are 
patched to L-amp plates in duplicate. The colonies are grown overnight then moved to 4°C. 
The colonies are lifted onto Colony/Plaque Screen Hybridization Transfer Membranes 
(Biotechnology Systems NEN Research Products). The membranes are incubated 2-3 
minutes in 0.75ml 0.5N NaOH twice. The membranes are then incubated 2-3 minutes in 
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0.75ml 1.0M Tris-HCl, pH 7.5 twice. The membranes are allowed to dry at room 
temperature. 

Probe #1 and probe #2 described in Example 13 are labeled using the DECAprime II 
Kit as described by the manufacturer (Ambion cat# 1455). Unincorporated nucleotides are 
removed from the labeled probes using Quick Spin Columns as described by the 
manufacturer (Boehringer Mannheim cat #1273973). The labeled probes are measured for 
incorporated radioactivity and the specific activity is 10,000,000 cpm. Membranes are 
prewetted with 2X SSC and hybridized with the probes for 12-16 hours at 65°C. One set of 
colony lifts is hybridized with probe #1 and the other set is hybridized with probe #2. The 
membranes are washed with wash CHURCH solutions 1 and 2 (Church and Gilbert, Proc. 
Natl Acad. Sci. USA 81:1991-1995 (1984)) and exposed to Kodak film. 

Twenty one clones are identified that hybridize to probe #1 and seven clones are 
identified that hybridize to probe #2. The gene in the clones isolated with probe #1 is 
named hph1 and the gene in the clones isolated with probe #2 is named hph2. 

Example 15: Insect Bioassays 

The clones identified in Example 14 are tested for insecticidal activity against the 
following insects in insect bioassays: Diabrotica virgifera virgifera (Western Corn Rootworm 
(WCR)), Diabrotica undecimpunctata howardi (Southern Corn Rootworm (SCR)), Ostrinia 
nubilaiis (European Corn Borer (ECB)), and Piuteila xylostelia (Diamondback Moth (DBM)). 

Diabrotica virgifera virgifera (Western Com Rootworm) and Diabrotica 
undecimpunctata howardi (Southern Corn Rootworm) assays are performed using a diet 
incorporation method. SOO^il of an overnight culture of the cosmid library in XL-1 Blue MR 
cells (Stratagene) is sonicated and then mixed with SOOpJ of diet. Once the diet solidifies, it 
is dispensed in a petri dish and 20 larvae are introduced over the diet. Trays of dishes are 
placed in an incubator for 3-5 days, and percent mortality is recorded at the end of the 
assay period. 

Ostrinia nubilaiis (European Corn Borer) and Piuteila xylostelia (Diamondback Moth) 
assays are performed by a surface treatment method. The diet is poured in the petri dish 
and allowed it to solidify. The E. coli culture of 200 -300^1 volume is dispensed over the diet 
surface and entire diet surface is covered to spread the culture with the help of bacterial 
loop. Once the surface is dry, 10 larvae are introduced over the diet surface. Trays of 
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dishes are placed in an incubator for 3-5 days. The assay with European Com Borer is 
incubated at 30°C in complete darkness; the assay with Diamondback Moth is incubated at 
72°F with a 14:10 (hours) lightrdark cycle. Percent mortality is recorded at the end of the 
assay period. 

Cosmids containing hph2 are identified with a range of activities, including: WCR 
only; SCR only; WCR and SCR; SCR and ECB; WCR, SCR, and ECB; or WCR, SCR, ECB, 
and DBM activity. 

In addition to probing the P. luminescens cosmid library with DNA probes, 600 
clones are screened by Western Com Rootworm bioassay. A clone is identified with activity 
against Western Com Rootworm. This clone hybridizes with probe #2. 

From these bioassays, cosmid 514, having activity against WCR, SCR, ECB, and 
DBM, is selected for sequencing. 

Example 16: Sequencing of Cosmid 514 

Cosmid 514 is sequenced using dye terminator chemistry on an ABI 377 instrument. 
The nucleotide sequence of cosmid 514 is set forth as SEQ ID NO:11. Cosmid 514 is 
designated pNOV2400 and deposited with the NRRL in E. coli DH5a and assigned 
accession no. B-30077. 

Example 17: Subcloning Insecticidal Regions of Cosmid 514 
514a 

An 9011 base pair fragment within cosmid 514 (SEQ ID NO:11) is removed by 
digesting the cosmid with the restriction endonuclease Spel (New England Biolabs 
(Massachusetts), and ligating (T4 DNA Ligase, NEB) the remainder of 514. Subclone 514a 
consists of cosmid 514 DNA from base pairs 1-2157 ligated to base pairs 11,169-37,948. 

H2Q2/DET34 

hph2 and orf2 (SEQ ID NO:11, base pairs 23,768-35,838) are cloned into pET34b 
(Novagen, Wisconsin). Restriction sites are engineered on both ends of each gene to 
facilitate cloning. PCR is used to add the restriction sites to the genes. A SamHI site is on 
the 5' end of hph2 immediately upstream of the ATG of hph2, and a Sad site is added to 
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the 3' end of hph2 immediately following the DNA triplet encoding the stop codon. A 
guanidine is added between the SamHI site and the start codon of hph2 to put the hph2 
gene in frame with the Cellulose Binding Domain tag in pET34b. Orf2 has a Sad site 
upstream of the 56 base pairs between the stop codon of hph2 and the start codon of orf2. 
The 56 base pairs are included in the hph2-orf2 construct to mimic their setup in the 514 
cosmid. 0rf2 has an Xho\ site on the 3' end immediately following the stop codon. The 
oligos used to add the restriction sites to hph2ax\6 orf2 are as follows: 

hph2-A 5M3GGGATCCGATGATTTTAAAAGG-3' (SEQ ID NO:1 5) 
hph2-B 5'-GCGCCATTGATTTGAG-3' (SEQ ID NO:16) 
hph2-C S'-CATTAGAGGTCGAACGTAC-S' (SEQ ID NO:17) 
hph2-D 5'-GAGCGAGCTCTTACTTAATGGTGTAG-3 , (SEQ ID NO:1 8) 
orf2-A3 5'-CAGCGAGCTCCATGCAGAATTCACAGAC-3' (SEQ ID NO:19) 
orf2-B 5'-GGCAATGGCAGCGATAAG-3' (SEQ ID NO:20) 
orf2-C 5'-CATTAACGCAGGAAGAGC-3' (SEQ ID NO:21) 
orf2-D 5'-GACCTCGAGTTACACGAGCGCGTCAG-3' (SEQ ID NO:22) 
The BamH\-Sad 7583 base pair fragment, corresponding to the hph2 gene, and the 
Sad-Xho\ 4502 base pair orf2 (including the 56 base pairs between hph2 and orf2 open 
reading frames), corresponding to orf2 t are ligated with SamHI-Xhoi-digested vector DNA 
pET34b. 

Orf5/pBS (Nof\-BamH\) 
The 5325 base pair Not\-BamH\ fragment of cosmid 514 is cloned into pBS-SK using 
y4fllll-A/ofl (415 bp) and BamH\-AfH\\ (2530 bp) fragments of pBS-SK. 

Q5-H2-Q2 

The 12,031 base pair BamH\-Xho\ fragment of H202/pET34 is cloned into the 8220 
base pair Xho\-BamH\ fragment of Orf5/pBS. 

O51011H2O2 

A 7298 base pair BamH\-Mlu\ fragment from subclone 514a is ligated (T4 DNA 
Ligase, NEB) with 9588 bp Mlu\-Xho\ and 8220 bp Xho\-BamH\ fragments of subclone 05- 
H2-02. The resulting ~ 22 kb subclone 051 01 1H2Q2, which has activity against WCR and 
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ECB, is designated pNOV1001 and deposited with the NRRL in E coli DH5cc and assigned 
accession no. B-30078. 

AKH2Q2 

A 12,074 base pair BarriH\-Avt\\ fragment of H202/pET34 is ligated (T4 DNA Ligase, 
NEB) into pK184 Nhe\-BamH\ fragment (2228 bp), generating a clone containing hph2 and 
orf2 in a p15a origin of replication, kanamycin-resistant vector. 

Example 18: Insecticidal Activity of Subclones 

Bioassays as described above are performed with E. coli cultures that express the 
above subclones, both singly and in combination. Coexpressing AKH202 and Orf5/pBS in 
E. coii, for example in DH5a or HB101, is found to give insecticidal activity against the 
Lepidopterans Plutella xylostella (Diamondback Moth), Ostrinia nubilafis (European Corn 
Borer), and Manduca sexta (Tobacco Homworm), as well as against the Coleopterans 
Diabrotica virgifera virgifera (Western Com Rootworm), Diabrotica undecimpunctata 
howardi (Southern Com Rootworm), and Leptinotarsa decimlineata (Colorado Potato 
Beetle). Thus, coexpression of hph2 (SEQ ID NO:11 t base pairs 23,768-31,336), orf2 
(SEQ ID NO:11, base pairs 31,393-35,838), and orf5 (SEQ ID NO:11, base pairs 15,171- 
18,035) is sufficient to control these insects. In addition, expression of each of these three 
ORFs on separate plasmids gives insect control activity, demonstrating that they do not 
have to be genetically linked to be active, so long as all three gene products are present. 

C. Expression of the Nucleic Acid Sequences of the Inventio n in Heterologous 

Microbial Hosts 

Microorganisms which are suitable for the heterologous expression of the nucleotide 
sequences of the invention are all microorganisms which are capable of colonizing plants or 
. the rhizosphere. As such they will be brought into contact with insect pests. These include 
gram-negative microorganisms such as Pseudomonas, Enterobacter and Serratia, the 
gram-positive microorganism Bacillus and the fungi Trichoderma t Gliocladium, and 
Saccharomyces cerevisiae. Particularly preferred heterologous hosts are Pseudomonas 
fluorescens, Pseudomonas putida, Pseudomonas cepacia, Pseudomonas aureofaciens, 
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Pseudomonas aurantiaca, Enterobacter cloacae, Serratia marscesens, Bacillus subtilis, 
Bacillus cereus, Trichoderma viride, Trichoderma harzianum, Gliocladium wrens, and 
Saccharomyces cerevisiae. 

Example 19: Expression of the Nucleotide Sequences in E. coli and Other Gram- 
Negative Bacteria 

Many genes have been expressed in gram-negative bacteria in a heterologous 
manner. Expression vector pKK223-3 (Pharmacia catalogue # 27-4935-01) allows 
expression in £ coli. This vector has a strong tac promoter (Brosius, J. et at., Proc. Natl. 
Acad. Sci. USA 81) regulated by the lac repressor and induced by IPTG. A number of other 
expression systems have been developed for use in E. coli. The thermoinducible 
expression vector pPL (Pharmacia #27-4946-01) uses a tightly regulated bacteriophage X 
promoter which allows for high level expression of proteins. The lac promoter provides 
another means of expression but the promoter is not expressed at such high levels as the 
tac promoter. With the addition of broad host range replicons to some of these expression 
system vectors, expression of the nucleotide sequence in closely related gram negative- 
bacteria such as Pseudomonas, Enterobacter, Serratia and Erwinia is possible. For 
example, pLRKD211 (Kaiser & Kroos, Proc. Natl. Acad. Sci. USA 81; 5816-5820 (1984)) 
contains the broad host range replicon on T which allows replication in many gram-negative 
bacteria. 

In £ coli, induction by IPTG is required for expression of the tac (i.e. trp-iac) 
promoter. When this same promoter {e.g. on wide-host range plasmid pLRKD211) is 
introduced into Pseudomonas it is constitutively active without induction by IPTG. This trp- 
lac promoter can be placed in front of any gene or operon of interest for expression in 
Pseudomonas or any other closely related bacterium for the purposes of the constitutive 
expression of such a gene. Thus, a nucleotide sequence whose expression results in an 
insecticidal toxin can therefore be placed behind a strong constitutive promoter, transferred 
to a bacterium which has plant or rhizosphere colonizing properties turning this organism to 
an insecticidal agent. Other possible promoters can be used for the constitutive expression 
of the nucleotide sequence in gram-negative bacteria. These include, for example, the 
promoter from the Pseudomonas regulatory genes gafA and lemA (WO 94/01561) and the 
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Pseudomonas savastanoi IAA operon promoter (Gaffney et a/., J. Bacterid. 172: 5593-5601 
(1990). 

Example 20: Expression of the Nucleotide Sequences in Gram-Positive Bacteria 

Heterologous expression of the nucleotides sequence in gram-positive bacteria is 
another means of producing the insecticidal toxins. Expression systems for Bacillus and 
Streptomyces are the best characterized. The promoter for the erythromycin resistance 
gene {ermR) from Streptococcus pneumoniae has been shown to be active in gram-positive 
aerobes and anaerobes and also in Ego// (Trieu-Cuot et a/., Nucl Acids Res 18: 3660 
(1990)). A further antibiotic resistance promoter from the thiostreptone gene has been used 
in Streptomyces cloning vectors (Bibb, Moi Gen Genet 199: 26-36 (1985)). The shuttle 
vector pHT3101 is also appropriate for expression in Bacillus (Lereclus, FEMS Microbiol 
Lett 60: 211-218 (1989)). A significant advantage of this approach is that many gram- 
positive bacteria produce spores which can be used in formulations that produce 
insecticidal agents with a longer shelf life. Bacillus and Streptomyces species are 
aggressive colonizers of soils 

Example 21 : Expression of the Nucleotide Sequences in Fungi 

Trichoderma harzianum and Gliocladium virens have been shown to provide varying 
levels of biocontrol in the field (US 5,165,928 and US 4,996,157, both to Cornell Research 
Foundation). A nucleotide sequence whose expression results in an insecticidal toxin could 
be expressed in such a fungus. This could be accomplished by a number of ways which are 
well known in the art. One is protoplast-mediated transformation of the fungus by PEG or 
electroporation-mediated techniques. Alternatively, particle bombardment can be used to 
transform protoplasts or other fungal cells with the ability to develop into regenerated 
mature structures. The vector pAN7-1, originally developed for Aspergillus transformation 
and now used widely for fungal transformation (Curragh et al. t MycoL Res. 97(3): 313-317 
(1992;; Tooley etai, Curr. Genet 27:55-60 (1992); Punt et a/., Gene 56: 117-124 (1987)) 
is engineered to contain the nucleotide sequence. This plasmid contains the E. coli the 
hygromycin B resistance gene flanked by the Aspergillus nidulans gpd promoter and the 
trpC terminator (Punt et a/., Gene 56: 1 17-124 (1987)). 
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ln a preferred embodiment, the nucleic acid sequences of the invention are 
expressed in the yeast Saccharomyces cerevisiae. Each of the three ORFs of SEQ ID 
NO: 11 (hph2, orf2 and orf5), which together confer insecticidal activity, are cloned into 
individual vectors with the GAL1 inducible promoter and the CYC1 terminator. Each vector 
has ampiciliin resistance and the 2 micron replicon. The vectors differ in their yeast growth 
markers. hph2 is cloned into p424 (TRP1, ATCC 87329), orf2 into p423 (HIS3, ATCC 
87327), and orf5 into p425 (LEU2, ATCC 87331). The three constructs are transformed 
into S. cerevisiae independently and together. The three ORFs are expressed together and 
tested for protein expression and insecticidal activity. 



The nucleic acid sequences described in this application can be incorporated into 
plant cells using conventional recombinant DNA technology. Generally, this involves 
inserting a coding sequence of the invention into an expression system to which the coding 
sequence is heterologous (i.e., not normally present) using standard cloning procedures 
known in the art. The vector contains the necessary elements for the transcription and 
translation of the inserted protein-coding sequences. A large number of vector systems 
known in the art can be used, such as plasmids, bacteriophage viruses and other modified 
viruses. Suitable vectors include, but are not limited to, viral vectors such as lambda vector 
systems A,gtl1, AxjtlO and Charon 4; plasmid vectors such as pBI121 t pBR322, pACYC177, 
pACYC184, pAR series, pKK223-3, pUC8, pUC9, pUC18, pUC19, pLG339, pRK290, 
pKC37, pKC101, pCDNAII; and other similar systems. The components of the expression 
system may also be modified to increase expression. For example, truncated sequences, 
nucleotide substitutions or other modifications may be employed. The expression systems 
described herein can be used to transform virtually any crop plant cell under suitable 
conditions. Transformed cells can be regenerated into whole plants such that the 
nucleotide sequence of the invention confer insect resistance to the transgenic plants. 

Example 22: Modification of Coding Sequences and Adjacent Sequences 



D. Expression of the Nucleotide Sequences in Transgenic Plants 



The nucleotide sequences described in this application can be modified for 
expression in transgenic plant hosts. A host plant expressing the nucleotide sequences and 
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which produces the insecticidal toxins in its cells has enhanced resistance to insect attack 
and is thus better equipped to withstand crop losses associated with such attack. 

The transgenic expression in plants of genes derived from microbial sources may 
require the modification of those genes to achieve and optimize their expression in plants. 
In particular, bacterial ORFs which encode separate enzymes but which are encoded by the 
same transcript in the native microbe are best expressed in plants on separate transcripts. 
To achieve this, each microbial ORF is isolated individually and cloned within a cassette 
which provides a plant promoter sequence at the 5' end of the ORF and a plant 
transcriptional terminator at the 3 1 end of the ORF. The isolated ORF sequence preferably 
includes the initiating ATG codon and the terminating STOP codon but may include 
additional sequence beyond the initiating ATG and the STOP codon. In addition, the ORF 
may be truncated, but still retain the required activity; for particularly long ORFs, truncated 
versions which retain activity may be preferable for expression in transgenic organisms. By 
"plant promoter" and "plant transcriptional terminator" it is intended to mean promoters and 
transcriptional terminators which operate within plant cells. This includes promoters and 
transcription terminators which may be derived from non-plant sources such as viruses (an 
example is the Cauliflower Mosaic Virus). 

In some cases, modification to the ORF coding sequences and adjacent sequence 
is not required. It is sufficient to isolate a fragment containing the ORF of interest and to 
insert it downstream of a plant promoter. For example, Gaffney et a/. (Science 261: 754- 
756 (1993)) have expressed the Pseudomonas nahG gene in transgenic plants under the 
control of the CaMV 35S promoter and the CaMV tml terminator successfully without 
modification of the coding sequence and with x bp of the Pseudomonas gene upstream of 
the ATG still attached, and y bp downstream of the STOP codon still attached to the nahG 
ORF. Preferably as little adjacent microbial sequence should be left attached upstream of 
the ATG and downstream of the STOP codon. In practice, such construction may depend 
on the availability of restriction sites. 

In other cases, the expression of genes derived from microbial sources may provide 
problems in expression. These problems have been well characterized in the art and are 
particularly common with genes derived from certain sources such as Bacillus. These 
problems may apply to the nucleotide sequence of this invention and the modification of 
these genes can be undertaken using techniques now well known in the art. The following 
problems may be encountered: 
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1 . Codon Usage. 

The preferred codon usage in plants differs from the preferred codon usage in 
certain microorganisms. Comparison of the usage of codons within a cloned microbial ORF 
to usage in plant genes (and in particular genes from the target plant) will enable an 
identification of the codons within the ORF which should preferably be changed. Typically 
plant evolution has tended towards a strong preference of the nucleotides C and G in the 
third base position of monocotyledons, whereas dicotyledons often use the nucleotides A or 
T at this position. By modifying a gene to incorporate preferred codon usage for a particular 
target transgenic species, many of the problems described below for GC/AT content and 
illegitimate splicing will be overcome. 

2. GC/AT Content. 

Plant genes typically have a GC content of more than 35%. ORF sequences which 
are rich in A and T nucleotides can cause several problems in plants. Firstly, motifs of 
ATTTA are believed to cause destabilization of messages and are found at the 3* end of 
many short-lived mRNAs. Secondly, the occurrence of polyadenylation signals such as 
AATAAA at inappropriate positions within the message is believed to cause premature 
truncation of transcription. In addition, monocotyledons may recognize AT-rich sequences 
as splice sites (see below). 

3. Sequences Adjacent to the Initiating Methionine. 

Plants differ from microorganisms in that their messages do not possess a defined 
ribosome binding site. Rather, it is believed that ribosomes attach to the 5' end of the 
message and scan for the first available ATG at which to start translation. Nevertheless, it 
is believed that there is a preference for certain nucleotides adjacent to the ATG and that 
expression of microbial genes can be enhanced by the inclusion of a eukaryotic consensus 
translation initiator at the ATG. Clontech (1993/1994 catalog, page 210, incorporated 
herein by reference) have suggested one sequence as a consensus translation initiator for 
the expression of the E. coli uidA gene in plants. Further, Joshi (NAR 15: 6643-6653 
(1987), incorporated herein by reference) has compared many plant sequences adjacent to 
the ATG and suggests another consensus sequence. In situations where difficulties are 
encountered in the expression of microbial ORFs in plants, inclusion of one of these 
sequences at the initiating ATG may improve translation. In such cases the last three 
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nucleotides of the consensus may not be appropriate for inclusion in the modified sequence 
due to their modification of the second AA residue. Preferred sequences adjacent to the 
initiating methionine may differ between different plant species. A survey of 14 maize 
genes located in the GenBank database provided the following results: 



Position Before the Initiating ATG in 14 Maize Genes : 
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This analysis can be done for the desired plant species into which the nucleotide sequence 
is being incorporated, and the sequence adjacent to the ATG modified to incorporate the 
preferred nucleotides. 

4. Removal of Illegitimate Splice Sites. 

Genes cloned from non-plant sources and not optimized for expression in plants 
may also contain motifs which may be recognized in plants as 5' or 3* splice sites, and be 
cleaved, thus generating truncated or deleted messages. These sites can be removed 
using the techniques well known in the art. 

Techniques for the modification of coding sequences and adjacent sequences are 
well known in the art. In cases where the initial expression of a microbial ORF is low and it 
is deemed appropriate to make alterations to the sequence as described above, then the 
construction of synthetic genes can be accomplished according to methods well known in 
the art. These are, for example, described in the published patent disclosures EP 0 385 
962 (to Monsanto), EP 0 359 472 (to Lubrizol) and WO 93/07278 (to Ciba-Geigy), all of 
which are incorporated herein by reference. In most cases it is preferable to assay the 
expression of gene constructions using transient assay protocols (which are well known in 
the art) prior to their transfer to transgenic plants. 
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Example 23: Construction of Plant Expression Cassettes 

Coding sequences intended for expression in transgenic plants are first assembled in 
expression cassettes behind a suitable promoter expressible in plants. The expression 
cassettes may also comprise any further sequences required or selected for the expression 
of the transgene. Such sequences include, but are not restricted to, transcription 
terminators, extraneous sequences to enhance expression such as introns, vital sequences, 
and sequences intended for the targeting of the gene product to specific organelles and cell 
compartments. These expression cassettes can then be easily transferred to the plant 
transformation vectors described below. The following is a description of various 
components of typical expression cassettes. 

1. Promoters 

The selection of the promoter used in expression cassettes will determine the spatial 
and temporal expression pattern of the transgene in the transgenic plant. Selected 
promoters will express transgenes in specific cell types (such as leaf epidermal cells, 
mesophyll cells, root cortex cells) or in specific tissues or organs (roots, leaves or flowers, 
for example) and the selection will reflect the desired location of accumulation of the gene 
product. Alternatively, the selected promoter may drive expression of the gene under 
various inducing conditions. Promoters vary in their strength, i.e., ability to promote 
transcription. Depending upon the host cell system utilized, any one of a number of suitable 
promoters can be used, including the gene's native promoter. The following are non- 
limiting examples of promoters that may be used in expression cassettes. 

a. Constitutive Expression, the Ubiquitin Promoter: 

Ubiquitin is a gene product known to accumulate in many cell types and its promoter 
has been cloned from several species for use in transgenic plants (e.g. sunflower - Binet et 
a/. Plant Science 79: 87-94 (1991); maize - Christensen et al. Plant Molec. Biol. 12: 619- 
632 (1989); and Arabidopsis - Norris et a/., Plant MoL Biol. 21 :895-906 (1993)). The maize 
ubiquitin promoter has been developed in transgenic monocot systems and its sequence 
and vectors constructed for monocot transformation are disclosed in the patent publication 
EP 0 342 926 (to Lubrizol) which is herein incorporated by reference. Taylor et al. (Plant 
Cell Rep. 12: 491-495 (1993)) describe a vector (pAHC25) that comprises the maize 
ubiquitin promoter and first intron and its high activity in cell suspensions of numerous 
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monocotyledons when introduced via microprojectile bombardment. The Arabidopsis 
ubiquitin promoter is ideal for use with the nucleotide sequences of the present invention. 
The ubiquitin promoter is suitable for gene expression in transgenic plants, both 
monocotyledons and dicotyledons. Suitable vectors are derivatives of pAHC25 or any of 
the transformation vectors described in this application, modified by the introduction of the 
appropriate ubiquitin promoter and/or intron sequences. 

b. Constitutive Expression, the CaMV 35S Promoter: 

Construction of the plasmid pCGN1761 is described in the published patent 
application EP 0 392 225 (Example 23), which is hereby incorporated by reference. 
pCGN1761 contains the "double" CaMV 35S promoter and the tml transcriptional terminator 
with a unique EcoRI site between the promoter and the terminator and has a pUC-type 
backbone. A derivative of pCGN1761 is constructed which has a modified polylinker which 
includes Notl and Xhol sites in addition to the existing EcoRI site. This derivative is 
designated pCGN1761ENX. pCGN1761ENX is useful for the cloning of cDNA sequences 
or coding sequences (including microbial ORF sequences) within its polylinker for the 
purpose of their expression under the control of the 35S promoter in transgenic plants. The 
entire 35S promoter-coding sequence-fm/ terminator cassette of such a construction can be 
excised by Hindlll, Sphl, Sail, and Xbal sites 5' to the promoter and Xbal, BamHI and Bgll 
sites 3' to the terminator for transfer to transformation vectors such as those described 
beiow. Furthermore, the double 35S promoter fragment can be removed by 5' excision with 
Hindlll, Sphl, Sail, Xbal, or Pstl, and 3' excision with any of the polylinker restriction sites 
(EcoRI, Notl or Xhol) for replacement with another promoter. If desired, modifications 
around the cloning sites can be made by the introduction of sequences that may enhance 
translation. This is particularly useful when overexpression is desired. For example, 
pCGN1761ENX may be modified by optimization of the translational initiation site as 
described in Example 37 of U.S. Patent No. 5,639,949, incorporated herein by reference. 

c. Constitutive Expression, the Actin Promoter: 

Several isoforms of actin are known to be expressed in most cell types and 
consequently the actin promoter is a good choice for a constitutive promoter. In particular, 
the promoter from the rice Actl gene has been cloned and characterized (McElroy et al. 
Plant Cell 2: 163-171 (1990)). A 1.3kb fragment of the promoter was found to contain all 
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the regulatory elements required for expression in rice protoplasts. Furthermore, numerous 
expression vectors based on the Actl promoter have been constructed specifically for use in 
monocotyledons (McElroy etal. Mol. Gen. Genet. 231: 150-160 (1991)). These incorporate 
the >Acf/-intron 1 , Adhl 5* flanking sequence and Adhl-\n\ron 1 (from the maize alcohol 
dehydrogenase gene) and sequence from the CaMV 35S promoter. Vectors showing 
highest expression were fusions of 35S and Actl intron or the Actl 5" flanking sequence and 
the Actl intron. Optimization of sequences around the initiating ATG (of the GUS reporter 
gene) also enhanced expression. The promoter expression cassettes described by McElroy 
et al. (MoL Gen. Genet. 231: 150-160 (1991)) can be easily modified for gene expression 
and are particularly suitable for use in monocotyledonous hosts. For example, promoter- 
containing fragments is removed from the McElroy constructions and used to replace the 
double 35S promoter in pCGN1761ENX, which is then available for the insertion of specific 
gene sequences. The fusion genes thus constructed can then be transferred to appropriate 
transformation vectors. In a separate report, the rice Actl promoter with its first intron has 
also been found to direct high expression in cultured barley cells (Chibbar et al. Plant Cell 
Rep. 12: 506-509 (1993)). 

d. Inducible Expression, the PR-1 Promoter: 

The double 35S promoter in pCGN1761ENX may be replaced with any other promoter 
of choice that will result in suitably high expression levels. By way of example, one of the 
chemically regulatable promoters described in U.S. Patent No. 5,614,395 may replace the 
double 35S promoter. The promoter of choice is preferably excised from its source by 
restriction enzymes, but can alternatively be PCR-amplified using primers that carry 
appropriate terminal restriction sites. Should PCR-amplification be undertaken, then the 
promoter should be re-sequenced to check for amplification errors after the cloning of the 
amplified promoter in the target vector. The chemically/pathogen regulatable tobacco PR- 
1a promoter is cleaved from piasmid pCIB1004 (for construction, see example 21 of 
EP 0 332 104, which is hereby incorporated by reference) and transferred to piasmid 
pCGN1761ENX (Uknes et al., 1992). pCIB1004 is cleaved with Ncol and the resultant 3' 
overhang of the linearized fragment is rendered blunt by treatment with T4 DNA 
polymerase. The fragment is then cleaved with Hindlll and the resultant PR-1a promoter- 
containing fragment is gel purified and cloned into pCGN1761ENX from which the double 
35S promoter has been removed. This is done by cleavage with Xhol and blunting with T4 
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polymerase, followed by cleavage with Hindlll and isolation of the larger vector-terminator 
containing fragment into which the pCIB1004 promoter fragment is cloned. This generates 
a pCGN1761ENX derivative with the PR-1a promoter and the tml terminator and an 
intervening potylinker with unique EcoRI and Notl sites. The selected coding sequence can 
be inserted into this vector, and the fusion products [i.e. promoter-gene-terminator) can 
subsequently be transferred to any selected transformation vector, including those 
described infra. Various chemical regulators may be employed to induce expression of the 
selected coding sequence in the plants transformed according to the present invention, 
including the benzothiadiazole, isonicotinic acid, and salicylic acid compounds disclosed in 
U.S. Patent Nos. 5,523,311 and 5,614,395. 

e. Inducible Expression, an Ethanol-lnducible Promoter: 

A promoter inducible by certain alcohols or ketones, such as ethanol, may also be 
used to confer inducible expression of a coding sequence of the present invention. Such a 
promoter is for example the alcA gene promoter from Aspergillus nidulans (Caddick et al. 
(1998) Nat Biotechnol 16:177-180). In A. nidulans, the alcA gene encodes alcohol 
dehydrogenase I, the expression of which is regulated by the AlcR transcription factors in 
presence of the chemical inducer. For the purposes of the present invention, the CAT 
coding sequences in plasmid palcA:CAT comprising a alcA gene promoter sequence fused 
to a minimal 35S promoter (Caddick et al. (1998) Nat Biotechnol 16:177-180) are replaced 
by a coding sequence of the present invention to form an expression cassette having the 
coding sequence under the control of the alcA gene promoter. This is carried out using 
methods well known in the art. 

f. Inducible Expression, a Glucocorticoid-Inducible Promoter: 

Induction of expression of a nucleic acid sequence of the present invention using 
systems based on steroid hormones is also contemplated. For example, a glucocorticoid- 
mediated induction system is used (Aoyama and Chua (1997) The Plant Journal 11: 605- 
612) and gene expression is induced by application of a glucocorticoid, for example a 
synthetic glucocorticoid, preferably dexamethasone, preferably at a concentration ranging 
from 0.1 mM to 1mM, more preferably from 10mM to 100mM. For the purposes of the 
present invention, the luciferase gene sequences are replaced by a nucleic acid sequence 
of the invention to form an expression cassette having a nucleic acid sequence of the 
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invention under the control of six copies of the GAL4 upstream activating sequences fused 
to the 35S minimal promoter. This is carried out using methods well known in the art. The 
trans-acting factor comprises the GAL4 DNA-binding domain (Keegan et al. (1986) Science 
231: 699-704) fused to the transactivating domain of the herpes viral protein VP16- 
(Triezenberg et al. (1988) Genes Devel. 2: 718-729) fused to the hormone-binding domain 
of the rat glucocorticoid receptor (Picard et al. (1988) Cell 54: 1073-1080). The expression 
of the fusion protein is controlled by any promoter suitable for expression in plants known in 
the art or described here. This expression cassette is also comprised in the plant comprising 
a nucleic acid sequence of the invention fused to the 6xGAL4/minimal promoter. Thus, 
tissue- or organ-specificity of the fusion protein is achieved leading to inducible tissue- or 
organ-specificity of the insecticidal toxin. 

g. Root Specific Expression: 

Another pattern of gene expression is root expression. A suitable root promoter is 
described by de Framond (FEBS 290 : 103-106 (1991)) and also in the published patent 
application EP 0 452 269, which is herein incorporated by reference. This promoter is 
transferred to a suitable vector such as pCGN1761ENX for the insertion of a selected gene 
and subsequent transfer of the entire promoter-gene-terminator cassette to a transformation 
vector of interest. 

h. Wound-lnducible Promoters: 

Wound-inducible promoters may also be suitable for gene expression. Numerous 
such promoters have been described (e.g. Xu et al. Plant Molec. Biol. 22: 573-588 (1993), 
Logemann et al. Plant Cell 1: 151-158 (1989), Rohrmeier & Lehle, Plant Molec. Biol. 22: 
783-792 (1993), Firek et al. Plant Molec. Biol. 22: 129-142 (1993), Warner et al. Plant J. 3: 
191-201 (1993)) and all are suitable for use with the instant invention. Logemann et al. 
describe the 5* upstream sequences of the dicotyledonous potato wunl gene. Xu et al. 
show that a wound-inducible promoter from the dicotyledon potato (p/n2) is active in the 
monocotyledon rice. Further, Rohrmeier & Lehle describe the cloning of the maize Wipl 
cDNA which is wound induced and which can be used to isolate the cognate promoter using 
standard techniques. Similar, Firek et al. and Warner et al. have described a wound- 
induced gene from the monocotyledon Asparagus officinalis, which is expressed at local 
wound and pathogen invasion sites. Using cloning techniques well known in the art, these 
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promoters can be transferred to suitable vectors, fused to the genes pertaining to this 
invention, and used to express these genes at the sites of plant wounding. 

i. Pith-Preferred Expression: 

Patent Application WO 93/07278, which is herein incorporated by reference, 
describes the isolation of the maize trpA gene, which is preferentially expressed in pith 
cells. The gene sequence and promoter extending up to -1726 bp from the start of 
transcription are presented. Using standard molecular biological techniques, this promoter, 
or parts thereof, can be transferred to a vector such as pCGN1761 where it can replace the 
35S promoter and be used to drive the expression of a foreign gene in a pith-preferred 
manner. In fact, fragments containing the pith-preferred promoter or parts thereof can be 
transferred to any vector and modified for utility in transgenic plants. 

j. Leaf -Specific Expression: 

A maize gene encoding phosphoenol carboxylase (PEPC) has been described by 
Hudspeth & Grula (Plant Molec Biol 12: 579-589 (1989)). Using standard molecular 
biological techniques the promoter for this gene can be used to drive the expression of any 
gene in a leaf-specific manner in transgenic plants. 

k. Pollen-Specific Expression: 

WO 93/07278 describes the isolation of the maize calcium-dependent protein kinase 
(CDPK) gene which is expressed in pollen cells. The gene sequence and promoter extend 
up to 1400 bp from the start of transcription. Using standard molecular biological 
techniques, this promoter or parts thereof, can be transferred to a vector such as 
pCGN1761 where it can replace the 35S promoter and be used to drive the expression of a 
nucleic acid sequence of the invention in a pollen-specific manner. 

2. Transcriptional Terminators 

A variety of transcriptional terminators are available for use in expression cassettes. 
These are responsible for the termination of transcription beyond the transgene and its 
correct polyadenylation. Appropriate transcriptional terminators are those that are known to 
function in plants and include the CaMV 35S terminator, the tml terminator, the nopaline 
synthase terminator and the pea rbcS E9 terminator. These can be used in both 
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monocotyledons and dicotyledons. In addition, a gene's native transcription terminator may 
be used. 

3. Sequences for the Enhancement or Regulation of Expression 

Numerous sequences have been found to enhance gene expression from within the 
transcriptional unit and these sequences can be used in conjunction with the genes of this 
invention to increase their expression in transgenic plants. 

Various intron sequences have been shown to enhance expression, particularly in 
monocotyledonous cells. For example, the introns of the maize Adhl gene have been found 
to significantly enhance the expression of the wild-type gene under its cognate promoter 
when introduced into maize cells. Intron 1 was found to be particularly effective and 
enhanced expression in fusion constructs with the chloramphenicol acetyltransferase gene 
(Callis et a/., Genes Develop. 1: 1183-1200 (1987)). In the same experimental system, the 
intron from the maize bronzel gene had a similar effect in enhancing expression. Intron 
sequences have been routinely incorporated into plant transformation vectors, typically 
within the non-translated leader. 

A number of non-translated leader sequences derived from viruses are also known to 
enhance expression, and these are particularly effective in dicotyledonous cells. 
Specifically, leader sequences from Tobacco Mosaic Virus (TMV, the "W -sequence"), Maize 
Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown to be 
effective in enhancing expression {e.g. Gallie et al. Nucl. Acids Res. 15: 8693-8711 (1987); 
Skuzeski et al. Plant Molec. Biol. 15: 65-79 (1990)). 

4. Targeting of the Gene Product Within the Cell 

Various mechanisms for targeting gene products are known to exist in plants and the 
sequences controlling the functioning of these mechanisms have been characterized in 
some detail. For example, the targeting of gene products to the chloroplast is controlled by 
a signal sequence found at the amino terminal end of various proteins which is cleaved 
during chloroplast import to yield the mature protein (e.g. Comai et al. J. Biol. Chem. 263: 
15104-15109 (1988)). These signal sequences can be fused to heterologous gene 
products to effect the import of heterologous products into the chloroplast (van den Broeck, 
et al. Nature 313: 358-363 (1985)). DNA encoding for appropriate signal sequences can be 
isolated from the 5' end of the cDNAs encoding the RUBISCO protein, the CAB protein, the 
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EPSP synthase enzyme, the GS2 protein and many other proteins which are known to be 
chloroplast localized. See a/so, the section entitled "Expression With Chloroplast Targeting" 
in Example 37 of U.S. Patent No. 5,639,949. 

Other gene products are localized to other organelles such as the mitochondrion and 
the peroxisome (e.g. Unger et al Plant Molec. Biol. 13: 411-418 (1989)). The cDNAs 
encoding these products can also be manipulated to effect the targeting of heterologous 
gene products to these organelles. Examples of such sequences are the nuclear-encoded 
ATPases and specific aspartate amino transferase isoforms for mitochondria. Targeting 
cellular protein bodies has been described by Rogers et al (Proc. Natl. Acad. Sci. USA 82: 
6512-6516(1985)). 

In addition, sequences have been characterized which cause the targeting of gene 
products to other cell compartments. Amino terminal sequences are responsible for 
targeting to the ER, the apoplast, and extracellular secretion from aleurone cells (Koehler & 
Ho, Plant Cell 2: 769-783 (1990)). Additionally, amino terminal sequences in conjunction 
with carboxy terminal sequences are responsible for vacuolar targeting of gene products 
(Shinshi etal. Plant Molec. Biol. 14: 357-368 (1990)). 

By the fusion of the appropriate targeting sequences described above to transgene 
sequences of interest it is possible to direct the transgene product to any organelle or cell 
compartment. For chloroplast targeting, for example, the chloroplast signal sequence from 
the RUBISCO gene, the CAB gene, the EPSP synthase gene, or the GS2 gene is fused in 
frame to the amino terminal ATG of the transgene. The signal sequence selected should 
include the known cleavage site, and the fusion constructed should take into account any 
amino acids after the cleavage site which are required for cleavage. In some cases this 
requirement may be fulfilled by the addition of a small number of amino acids between the 
cleavage site and the transgene ATG or, alternatively, replacement of some amino acids 
within the transgene sequence. Fusions constructed for chloroplast import can be tested 
for efficacy of chloroplast uptake by in vitro translation of in vitro transcribed constructions 
followed by in vitro chloroplast uptake using techniques described by Bartlett et al. In: 
Edelmann et al (Eds.) Methods in Chloroplast Molecular Biology, Elsevier pp 108M091 
(1982) and Wasmann et al. Mol. Gen. Genet. 205: 446-453 (1986). These construction 
techniques are well known in the art and are equally applicable to mitochondria and 
peroxisomes. 
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The above-described mechanisms for cellular targeting can be utilized not only in 
conjunction with their cognate promoters, but also in conjunction with heterologous 
promoters so as to effect a specific cell-targeting goal under the transcriptional regulation of 
a promoter that has an expression pattern different to that of the promoter from which the 
targeting signal derives. 

Example 24: Construction of Plant Transformation Vectors 

Numerous transformation vectors available for plant transformation are known to 
those of ordinary skill in the plant transformation arts, and the genes pertinent to this 
invention can be used in conjunction with any such vectors. The selection of vector will 
depend upon the preferred transformation technique and the target species for 
transformation. For certain target species, different antibiotic or herbicide selection markers 
may be preferred. Selection markers used routinely in transformation include the nptll 
gene, which confers resistance to kanamycin and related antibiotics (Messing & Vierra. 
Gene 19: 259-268 (1982); Bevan et aL, Nature 304:184-187 (1983)), the bar gene, which 
confers resistance to the herbicide phosphinothricin (White et al., Nucl. Acids Res 18: 1062 
(1990), Spencer et aL Theor. Appl. Genet 79: 625-631 (1990)), the hph gene, which confers 
resistance to the antibiotic hygromycin (Blochinger & Diggelmann, Mol Cell Biol 4: 2929- 
2931), and the d/ifrgene, which confers resistance to methatrexate (Bourouis et al., EMBO 
J. 2(7) : 1099-1104 (1983)), and the EPSPS gene, which confers resistance to glyphosate 
(U.S. Patent Nos. 4,940,935 and 5,188,642). 

1 . Vectors Suitable for Agrobacterium Transformation 

Many vectors are available for transformation using Agrobacterium tumefaciens. 
These typically carry at least one T-DNA border sequence and include vectors such as 
pBIN19 (Bevan, Nucl. Acids Res. (1984)) and pXYZ. Below, the construction of two typical 
vectors suitable for Agrobacterium transformation is described. 

a. pCIB200andpCIB2001: 

The binary vectors pclB200 and pCIB2001 are used for the construction of 
recombinant vectors for use with Agrobacterium and are constructed in the following 
manner. pTJS75kan is created by Narl digestion of pTJS75 (Schmidhauser & Helinski, J. 



WO 99/42589 



-57- 



PCT/EP99/01015 



Bacteriol. 164 : 446-455 (1985)) allowing excision of the tetracycline-resistance gene, 
followed by insertion of an Accl fragment from pUC4K carrying an NPTII (Messing & Vierra, 
Gene 19: 259-268 (1982): Bevan et aL, Nature 304: 184-187 (1983): McBride et al., Plant 
Molecular Biology 14-' 266-276 (1990)). Xhol linkers are ligated to the EcoRV fragment of 
PCIB7 which contains the left and right T-DNA borders, a plant selectable nos/nptll chimeric 
gene and the pUC polylinker (Rothstein et al., Gene 53: 153-161 (1987)), and the Xhol- 
digested fragment are cloned into Sa//-digested pTJS75kan to create pCIB200 (see also EP 
0 332 104, example 19). pCIB200 contains the following unique polylinker restriction sites: 
EcoRI, Sstl, Kpnl, Bglll, Xbal, and Sail. pCIB2001 is a derivative of pCIB200 created by the 
insertion into the polylinker of additional restriction sites. Unique restriction sites in the 
polylinker of pCIB2001 are EcoRI, Sstl, Kpnl, Bglll, Xbal, Sail, Mlul, Bell, Avrll, Apal, Hpal, 
and Stut. pCIB2001 , in addition to containing these unique restriction sites also has plant 
and bacterial kanamycin selection, left and right T-DNA borders for Agrobacterium-mB6\a\ed 
transformation, the RK2-derived trfA function for mobilization between E coll and other 
hosts, and the Or/Tand OriV functions also from RK2. The pCIB2001 polylinker is suitable 
for the cloning of plant expression cassettes containing their own regulatory signals. 

b. pCIBIO and Hygromycin Selection Derivatives thereof: 

The binary vector pCIBIO contains a gene encoding kanamycin resistance for 
selection in plants and T-DNA right and left border sequences and incorporates sequences 
from the wide host-range plasmid pRK252 allowing it to replicate in both E coii and 
Agrobacterium. Its construction is described by Rothstein et aL (Gene 53: 153-161 (1987)). 
Various derivatives of pCIBIO are constructed which incorporate the gene for hygromycin B 
phosphotransferase described by Gritz et al. (Gene 25: 179-188 (1983)). These derivatives 
enable selection of transgenic plant cells on hygromycin only (pCIB743), or hygromycin and 
kanamycin (pCIB715, pCIB717). 

2. Vectors Suitable for non-Agrobacterium Transformation 

Transformation without the use of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transformation vector and consequently 
vectors lacking these sequences can be utilized in addition to vectors such as the ones 
described above which contain T-DNA sequences. Transformation techniques that do not 
rely on Agrobacterium include transformation via particle bombardment, protoplast uptake 
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(e.g. PEG and electroporation) and microinjection. The choice of vector depends largely on 
the preferred selection for the species being transformed. Below, the construction of typical 
vectors suitable for non-Agrobacterium transformation is described. 

a. pCIB3064: 

pCIB3064 is a pUC-derived vector suitable for direct gene transfer techniques in 
combination with selection by the herbicide basta (or phosphinothricin). The plasmid 
pCIB246 comprises the CaMV 35S promoter in operational fusion to the E. coll GUS gene 
and the CaMV 35S transcriptional terminator and is described in the PCT published 
application WO 93/07278. The 35S promoter of this vector contains two ATG sequences 5' 
of the start site. These sites are mutated using standard PCR techniques in such a way as 
to remove the ATGs and generate the restriction sites Sspl and Pvull. The new restriction 
sites are 96 and 37 bp away from the unique Sail site and 101 and 42 bp away from the 
actual start site. The resultant derivative of pCIB246 is designated pCIB3025. The GUS 
gene is then excised from pCIB3025 by digestion with Sail and Sac/, the termini rendered 
blunt and religated to generate plasmid pCIB3060. The plasmid pJIT82 is obtained from the 
John Innes Centre, Norwich and the a 400 bp Smal fragment containing the bar gene from 
Streptomyces viridochromogenes is excised and inserted into the Hpal site of pCIB3060 
(Thompson et ai EMBO J 6: 2519-2523 (1987)). This generated pCIB3064, which 
comprises the bar gene under the control of the CaMV 35S promoter and terminator for 
herbicide selection, a gene for ampicillin resistance (for selection in E colt) and a polylinker 
with the unique sites Sphl, Pstl, Hindlll, and BamHI. This vector is suitable for the cloning 
of plant expression cassettes containing their own regulatory signals. 

b. pSOG19and pSOG35: 

pSOG35 is a transformation vector that utilizes the E coll gene dihydrofolate 
reductase (DFR) as a selectable marker conferring resistance to methotrexate. PCR is 
used to amplify the 35S promoter (-800 bp), intron 6 from the maize Adh1 gene (-550 bp) 
and 18 bp of the GUS untranslated leader sequence from pSOG10. A 250-bp fragment 
encoding the E. coli dihydrofolate reductase type II gene is also amplified by PCR and 
these two PCR fragments are assembled with a Sacl-Pstl fragment from pB1221 (Clontech) 
which comprises the pUC19 vector backbone and the nopaline synthase terminator. 
Assembly of these fragments generates pSOG19 which contains the 35S promoter in fusion 



WO 99/42589 



-59- 



PCT/EP99/01015 



with the intron 6 sequence, the GUS leader, the DHFR gene and the nopaiine synthase 
terminator. Replacement of the GUS leader in pSOG19 with the leader sequence from 
Maize Chlorotic Mottle Virus (MCMV) generates the vector pSOG35. pSOG19 and pSOG35 
carry the pUC gene for ampicillin resistance and have Hindlll, Sphl, Pstl and EcoRI sites 
available for the cloning of foreign substances. 

Example 25: Transformation 

Once a nucleic acid sequence of the invention has been cloned into an expression 
system, it is transformed into a plant cell. Methods for transformation and regeneration of 
plants are well known in the art. For example, Ti plasmid vectors have been utilized for the 
delivery of foreign DNA, as well as direct DNA uptake, liposomes, electroporation, micro- 
injection, and microprojectiles. In addition, bacteria from the genus Agrobacterium can be 
utilized to transform plant cells. Below are descriptions of representative techniques for 
transforming both dicotyledonous and monocotyledonous plants. 

1 . Transformation of Dicotyledons 

Transformation techniques for dicotyledons are well known in the art and include 
Agrobacterium-based techniques and techniques that do not require Agrobacterium. Non- 
Agrobacterium techniques involve the uptake of exogenous genetic material directly by 
protoplasts or cells. This can be accomplished by PEG or electroporation mediated uptake, 
particle bombardment-mediated delivery, or microinjection. Examples of these techniques 
are described by Paszkowski et a/., EMBO J 3: 2717-2722 (1984), Potrykus era/., Mol. Gen. 
Genet. 199: 169-177 (1985), Reich et a/., Biotechnology 4: 1001-1004 (1986), and Klein et 
a/., Nature 327: 70-73 (1987). In each case the transformed cells are regenerated to whole 
plants using standard techniques known in the art. 

Agrobacterium-mediated transformation is a preferred technique for transformation of 
dicotyledons because of its high efficiency of transformation and its broad utility with many 
different species. Agrobacterium transformation typically involves the transfer of the binary 
vector carrying the foreign DNA of interest (e.g. pCIB200 or pCIB2001) to an appropriate 
Agrobacterium strain which may depend of the complement of vir genes carried by the host 
Agrobacterium strain either on a co-resident Ti plasmid or chromosomally (e.g. strain 
CIB542 for pCIB200 and pCIB2001 (Uknes et ai Plant Cell 5: 159-169 (1993)). The 
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transfer of the recombinant binary vector to Agrobacterium is accomplished by a triparental 
mating procedure using E. coli carrying the recombinant binary vector, a helper £. coli strain 
which carries a plasmid such as pRK2013 and which is able to mobilize the recombinant 
binary vector to the target Agrobacterium strain. Alternatively, the recombinant binary 
vector can be transferred to Agrobacterium by DNA transformation (Hofgen & Willmitzer, 
NucL Acids Res. 16: 9877 (1988)). 

Transformation of the target plant species by recombinant Agrobacterium usually 
involves co-cultivation of the Agrobacterium with explants from the plant and follows 
protocols well known in the art. Transformed tissue is regenerated on selectable medium 
carrying the antibiotic or herbicide resistance marker present between the binary plasmid T- 
DNA borders. 

Another approach to transforming plant cells with a gene involves propelling inert or 
biologically active particles at plant tissues and cells. This technique is disclosed in U.S. 
Patent Nos. 4,945,050, 5,036,006, and 5,100,792 all to Sanford et al. Generally, this 
procedure involves propelling inert or biologically active particles at the cells under 
conditions effective to penetrate the outer surface of the cell and afford incorporation within 
the interior thereof. When inert particles are utilized, the vector can be introduced into the 
cell by coating the particles with the vector containing the desired gene. Alternatively, the 
target cell can be surrounded by the vector so that the vector is carried into the cell by the 
wake of the particle. Biologically active particles (e.g., dried yeast cells, dried bacterium or a 
bacteriophage, each containing DNA sought to be introduced) can also be propelled into 
plant cell tissue. 

2. Transformation of Monocotyledons 

Transformation of most monocotyledon species has now also become routine. 
Preferred techniques include direct gene transfer into protoplasts using PEG or 
electroporation techniques, and particle bombardment into callus tissue. Transformations 
can be undertaken with a single DNA species or multiple DNA species {i.e. co- 
transformation) and both these techniques are suitable for use with this invention. Co- 
transformation may have the advantage of avoiding complete vector construction and of 
generating transgenic plants with unlinked loci for the gene of interest and the selectable 
marker, enabling the removal of the selectable marker in subsequent generations, should 
this be regarded desirable. However, a disadvantage of the use of co-transformation is the 
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less than 100% frequency with which separate DNA species are integrated into the genome 
(Schocher etai Biotechnology 4: 1093-1096 (1986)). 

Patent Applications EP 0 292 435, EP 0 392 225, and WO 93/07278 describe 
techniques for the preparation of callus and protoplasts from an elite inbred line of maize, 
transformation of protoplasts using PEG or electroporation, and the regeneration of maize 
plants from transformed protoplasts. Gordon-Kamm et ai (Plant Cell 2: 603-618 (1990)) 
and Fromm et ai (Biotechnology 8: 833-839 (1990)) have published techniques for 
transformation of A188-derived maize line using particle bombardment. Furthermore, 
WO 93/07278 and Koziel etal. (Biotechnology 11; 194-200 (1993)) describe techniques for 
the transformation of elite inbred lines of maize by particle bombardment. This technique 
utilizes immature maize embryos of 1.5-2.5 mm length excised from a maize ear 14-15 days 
after pollination and a PDS-1000He Biolistics device for bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques 
utilizing protoplasts or particle bombardment. Protoplast-mediated transformation has been 
described for Japon/ca-types and /nd/ca-types (Zhang et ai Plant Cell Rep 7: 379-384 
(1988); Shimamoto etai Nature 338: 274-277 (1989); Datta etal Biotechnology 8: 736-740 
(1990)). Both types are also routinely transformable using particle bombardment (Christou 
et ai Biotechnology 9: 957-962 (1991)). Furthermore, WO 93/21335 describes techniques 
for the transformation of rice via electroporation. 

Patent Application EP 0 332 581 describes techniques for the generation, 
transformation and regeneration of Pooideae protoplasts. These techniques allow the 
transformation of Dactylis and wheat. Furthermore, wheat transformation has been 
described by Vasil et ai (Biotechnology 10: 667-674 (1992)) using particle bombardment 
into cells of type C long-term regenerate callus, and also by Vasil et ai (Biotechnology H: 
1553-1558 (1993)) and Weeks et ai (Plant Physiol. 102: 1077-1084 (1993)) using particle 
bombardment of immature embryos and immature embryo-derived callus. A preferred 
technique for wheat transformation, however, involves the transformation of wheat by 
particle bombardment of immature embryos and includes either a high sucrose or a high 
maltose step prior to gene delivery. Prior to bombardment, any number of embryos (0.75-1 
mm in length) are plated onto MS medium with 3% sucrose (Murashiga & Skoog, 
Physiologia Plantarum 15: 473-497 (1962)) and 3 mg/l 2,4-D for induction of somatic 
embryos, which is allowed to proceed in the dark. On the chosen day of bombardment, 
embryos are removed from the induction medium and placed onto the osmoticum (i.e. 
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induction medium with sucrose or maltose added at the desired concentration, typically 
15%). The embryos are allowed to plasmolyze for 2-3 h and are then bombarded. Twenty 
embryos per target plate is typical, although not critical. An appropriate gene-carrying 
plasmid (such as pCIB3064 or pSG35) is precipitated onto micrometer size gold particles 
using standard procedures. Each plate of embryos is shot with the DuPont Biolistics® 
helium device using a burst pressure of -1000 psi using a standard 80 mesh screen. After 
bombardment, the embryos are placed back into the dark to recover for about 24 h (still on 
osmoticum). After 24 hrs, the embryos are removed from the osmoticum and placed back 
onto induction medium where they stay for about a month before regeneration. 
Approximately one month later the embryo explants with developing embryogenic callus are 
transferred to regeneration medium (MS + 1 mg/liter NAA, 5 mg/liter GA), further containing 
the appropriate selection agent (10 mg/l basta in the case of pCIB3064 and 2 mg/l 
methotrexate in the case of pSOG35). After approximately one month, developed shoots 
are transferred to larger sterile containers known as "GA7s" which contain half^strength MS, 
2% sucrose, and the same concentration of selection agent. 

Tranformation of monocotyledons using Agrobacterium has also been described. 
See, WO 94/00977 and U.S. Patent No. 5,591,616, both of which are incorporated herein 
by reference. 



Example 26: Breeding 

The plants obtained via tranformation with a nucleic acid sequence of the present 
invention can be any of a wide variety of plant species, including those of monocots and 
dicots; however, the plants used in the method of the invention are preferably selected from 
the list of agronomically important target crops set forth supra. The expression of a gene of 
the present invention in combination with other characteristics important for production and 
quality can be incorporated into plant lines through breeding. Breeding approaches and 
techniques are known in the art. See, for example, Welsh J. R., Fundamentals of Plant 
Genetics and Breeding, John Wiley & Sons, NY (1981); Crop Breeding, Wood D. R. (Ed.) 
American Society of Agronomy Madison, Wisconsin (1983); Mayo O., The Theory of Plant 
Breeding, Second Edition, Clarendon Press, Oxford (1987); Singh, D.P., Breeding for 
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Resistance to Diseases and Insect Pests, Springer-Verlag, NY (1986); and Wricke and 
Weber, Quantitative Genetics and Selection Plant Breeding, Walter de Gruyter and Co., 
Berlin (1986). 

The genetic properties engineered into the transgenic seeds and plants described 
above are passed on by sexual reproduction or vegetative growth and can thus be 
maintained and propagated in progeny plants. Generally said maintenance and propagation 
make use of known agricultural methods developed to fit specific purposes such as tilling, 
sowing or harvesting. Specialized processes such as hydroponics or greenhouse 
technologies can also be applied. As the growing crop is vulnerable to attack and damages 
caused by insects or infections as well as to competition by weed plants, measures are 
undertaken to control weeds, plant diseases, insects, nematodes, and other adverse 
conditions to improve yield. These include mechanical measures such a tillage of the soil or 
removal of weeds and infected plants, as well as the application of agrochemicals such as 
herbicides, fungicides, gametocides, nematicides, growth regulants, ripening agents and 
insecticides. 

Use of the advantageous genetic properties of the transgenic plants and seeds 
according to the invention can further be made in plant breeding, which aims at the 
development of plants with improved properties such as tolerance of pests, herbicides, or 
stress, improved nutritional value, increased yield, or improved structure causing less loss 
from lodging or shattering. The various breeding steps are characterized by well-defined 
human intervention such as selecting the lines to be crossed, directing pollination of the 
parental lines, or selecting appropriate progeny plants. Depending on the desired 
properties, different breeding measures are taken. The relevant techniques are well known 
in the art and include but are not limited to hybridization, inbreeding, backcross breeding, 
multiline breeding, variety blend, interspecific hybridization, aneuploid techniques, etc. 
Hybridization techniques also include the sterilization of plants to yield male or female 
sterile plants by mechanical, chemical, or biochemical means. Cross pollination of a male 
sterile plant with pollen of a different line assures that the genome of the male sterile but 
female fertile plant will uniformly obtain properties of both parental lines. Thus, the 
transgenic seeds and plants according to the invention can be used for the breeding of 
improved plant lines, that for example, increase the effectiveness of conventional methods 
such as herbicide or pestidice treatment or allow one to dispense with said methods due to 
their modified genetic properties. Alternatively new crops with improved stress tolerance can 
be obtained, which, due to their optimized genetic "equipment", yield harvested product of 
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better quality than products that were not able to tolerate comparable adverse 
developmental conditions. 

Example 27: Seed Production 

In seed production, germination quality and uniformity of seeds are essential product 
characteristics, whereas germination quality and uniformity of seeds harvested and sold by 
the farmer is not important. As it is difficult to keep a crop free from other crop and weed 
seeds, to control seedborne diseases, and to produce seed with good germination, fairly 
extensive and well-defined seed production practices have been developed by seed 
producers, who are experienced in the art of growing, conditioning and marketing of pure 
seed. Thus, it is common practice for the farmer to buy certified seed meeting specific 
quality standards instead of using seed harvested from his own crop. Propagation material 
to be used as seeds is customarily treated with a protectant coating comprising herbicides, 
insecticides, fungicides, bactericides, nematicides, molluscicides, or mixtures thereof. 
Customarily used protectant coatings comprise compounds such as captan, carboxin, 
thiram (TMTD*), methalaxyl (Apron®), and pirimiphos-methyl (Actellic*). If desired, these 
compounds are formulated together with further carriers, surfactants or application- 
promoting adjuvants customarily employed in the art of formulation to provide protection 
against damage caused by bacterial, fungal or animal pests. The protectant coatings may 
be applied by impregnating propagation material with a liquid formulation or by coating with 
a combined wet or dry formulation. Other methods of application are also possible such as 
treatment directed at the buds or the fruit. 

It is a further aspect of the present invention to provide new agricultural methods, 
such as the methods examplified above, which are characterized by the use of transgenic 
plants, transgenic plant material, or transgenic seed according to the present invention. 

The seeds may be provided in a bag, container or vessel comprised of a suitable 
packaging material, the bag or container capable of being closed to contain seeds. The 
bag, container or vessel may be designed for either short term or long term storage, or both, 
of the seed. Examples of a suitable packaging material include paper, such as kraft paper, 
rigid or pliable plastic or other polymeric material, glass or metal. Desirably the bag, 
container, or vessel is comprised of a plurality of layers of packaging materials, of the same 
or differing type. In one embodiment the bag, container or vessel is provided so as to 
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exclude or limit water and moisture from contacting the seed. In one example, the bag, 
container or vessel is sealed, for example heat sealed, to prevent water or moisture from 
entering. In another embodiment water absorbent materials are placed between or 
adjacent to packaging material layers. In yet another embodiment the bag, container or 
vessel, or packaging material of which it is comprised is treated to limit, suppress or 
prevent disease, contamination or other adverse affects of storage or transport of the seed. 
An example of such treatment is sterilization, for example by chemical means or by 
exposure to radiation. Comprised by the present invention is a commercial bag comprising 
seed of a transgenic plant comprising a gene of the present invention that is expressed in 
said transformed plant at higher levels than in a wild type plant, together with a suitable 
carrier, together with label instructions for the use thereof for conferring broad spectrum 
disease resistance to plants. 
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BUDAPEST TREATY ON THS INTERNATIONAL RECOGNITION OF THE DEPOSIT 
OF MX CRDORO AKIfiM S FOR THE PURPOSE OT PATENT PROCEDURES 



TO 



Novartis AO 
Novartis Corporation 
3054 Cornwallla Rd. 
Reeearch Triangle Park, 
KC 27709 

NAME AMD ADDRESS OF THE PARTY TO 
THE VIABILIT Y STATEMENT IS ISSUED 



INTERNATIONAL FORK 

VTnfiTTiTTT nTnTfffflnfT 

leeued pursuant to Rule 10.2 by tfaa 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified at tha bottom of thla page 



DEPOSITOR 



Nana i Novartla AO 

Novartis Corporation 
Address i 30S4 Cornwallla Rd. 

Reeearch Triangle Park, 

NO 27709 



XI. IDENTIFICATION OP THE MICROORGANISM 



Depositor *e taxonomic daaignatlon and 
accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY t 
Escherichia coU NRRL B-30077 

Date oft October 2B, 1998 
1 Original. Depoait 

□ ' New Depoait 

□ 1 Repropagation of original Depoait 



ill. 



JJLL 



~VX ABILITY STATEMENT 



Depoait wai found t Viable n Nonviable on October 31, 1998 (Data) 

Intarnationai Depoaitary Authority a preparation vae found viable on Dae. 



8, l99S|Data>* 



DEPOSITOR'S EQUIVALENCY DECLARATION 



Depoaitor detarminad the International Depoaitary Authority preparation waa 

[2 « Equivalent Q 2 Not equivalent to depoait on l~ 4l£ 9 (Date) 

Signature of Depositor ^LcgaU 



tv CONDITIONS UNDER WHICH THE VIABILITY TEST WAS PERFORMED ( Depositors/ Deposit ar^ 


The drifted coifrvr«* tu<xs pu*- info 2mls 
over-nl^K** rhe.rVng , Some o% ir\r^ < 


f/ajuid eu/fwfe too* s-fttoJ<*i °^^°15Cykd 




Naraei Agricultural Raaearch culture 
Collection (NRRL) 
international Depoeltary Authority 

Address t 1815 N. Univerelty Street 

Eggsla, Ulinaifl ciso* p-s.a. 


Signature! s> of pereon<s) having the power 
to represent tha International Depoaitary 
Authority or of authorised offleial(e)t 

1 Basgj — 1 



Indlcata tha of Cho 

Hark Will* ■ «o« th« ■PW— 
in tha *■*•«■« " 

rill in i< Inter— 



dapMic or -non ft now ooooaic haa 

boa. 

10.2 la) 111) and 1111). ratar to tha moat racant viability taat. 
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BUDAPEST TRBATY OH TOT IHTBRKAXXOHAIa 
MOOOH1TXOH OF THB OBPOSXT OF XiemOOROANISHS 
FOR THB PURPOflB Of PATBNT PROCBDDRBS 



TO 

Novertla AO 
Novartia corporation 
30 54 comvallla Rd. 
Research Triangle Park, 
HO 27709 



XHTBRNATXOSftX. FORM 

RXCBXPT XN THB CABR OF AN ORXBIHAI* DEPOSIT 
iaaued pursuant to Rula 7-X by tba 
IHTBRNATXOHAI* DEPOSITARY AUTHORXTT 
identified at the bottom of thie — 



HAMB AMD ADDRSSS 
OF DEPOSITOR 



t^tptcatip * or the hicrooroakism 



Identification reference given by the 
DEPOSITOR. 

jeeaerioiaa coll pNOVR400 | 



Accession number given by the 
XHTERHATXQHAL OIPOSITARY AUTHORITY I 

HRRX» B-30077 



,^f M DBB CT IPSSg AMD /OR PRgPggB TAYOHOMIC DBBIOHATION 



The microoraani.m identified inder X. above was accompanied by, 
Q a scientific deecription 



Thi» *»— - _ 

above, whieh was received 



proposed texonomio deeLgnation 
, w wifeh ■ groee wh ere applicable) 



p«ttTPT AW P XCCBPTAHCB 
:„«rn.tlon.l D^ityl«*« 



,thorlty acc.pc. u. ■luroorg.ni.m ^*J-*"?J?f 
on oetob-r 28, 1998 (d.t. of th. origin*! deport) 



T« MC8IP* C M«"P lr0n CONV»HHIO» 

p.po.itary ^thorfr" ^ . ^IfSd-r th. Bud.p.-t «.«ty «*■ «"^ d b * 

to oonv-rt th. origin.1 ****** ^ £{jr eonv|irBlon , . 



Nmoi Agriculture R»«Mrch eultur. 
collection <N8Sb) 
lnt.rn.tion.1 o.po.lt.ry Authority 

iddr.aa. 1818 H. Unlveraity Btro-t 

Pooria. IlH-aoia 6160* tJ.&.k 



Signature*) of poraon ( a > h * v ^» *^ i L w 
to r.pr...nt th. International 
Authority or of au^ri«ado«lolal(a» . 

' /X 3 "it 

Datei 



Paorl-a- ^ lltnaj-B — 

^ Rui. «.4|d» APPll-a. «** daf i- th. data on which th. at.tua of International 
dspoaitary authority **• acquired. 
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BUDAPEST TREATY OK TBS INTERNATIONAL 
RECOGNITION OF TBB DEPOSIT 07 KXOROOROANISMS 
fOR THE PURPOSE OF PATENT PROCEDURES 



INTERNATIONAL FORM 



TO 

Movartls AO 
Novartls Corporation 
3084 Cornwallis Rd. 
Research Triangle Park, 
HC 27709 


RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 
issued pursuant to Rule 7.1 by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified at the bottom of this page 


HAMS AND ADDRESS 
OF DEPOSITOR 




I . IDENTIFICATION OF THE MICROORGANISM 


Identification rafaranca given by the 
DEPOSITORi 


Accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY 1 


SmeharleJUA aoXX pNOVlOOl 


NRRL 8-30076 


IX. SCIENTIFIC DESCRIPTION AND /OR PROPOSED TAXOHOMXC OSS 10 NAT I ON 


Thai microorganism identified under I. abo* 


re wae accompanied byi 


n a soiantlf ic description 




ETI * propoeed tajeawomlc designation 




(Mark with a eroee vhare applicable) 




XXX. RECEIPT AND ACCEPTANCE 


This international Depositary Authority accepts the microorganism identified under X. 
above, which was reoeived by it on October 28, 1996(date of the original deposit)* 


XV. RECEIPT OF REQUEST FOR CONVERSION 


The microorganism identified under X. above wae received- by this international 
Depositary Authority on (data at the original deposit) and a request 
to convert the original deposit to a deposit under the Budapest Treaty wae received by 
it on (date of receipt of request for conversion). 


V. INTERNATIONAL DEPOSITARY AUTHORITY 


Hanoi Agricultural Research Culture 
Collect ion <NRAL) 
International Depositary Authority 


Signature <s ) of person (s) having the power 
to represent the International Depositary 
Authority or of author lsed official ( s)i 

Data. >*■ 


Addreest 1815 N. Unlvereity Street 

Peoria, Xlllnole 61604 U.S.A. 



1 Where Rule 6.4(d) applies, such data is the date on which the status of international 
depositary authority was acquired. 
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BUDAPEST TREATY OK TRX INTERNATIONAL RSOOGHZTZOH OF TBS DEPOSIT 
OF MICROOROAHXBXB FOR TBS PURPOSE OF PATENT PROCIDORES 



TO 

Novartia AO 

Novartia corporation 

3 OS 4 Comweilia Ad. 

Raaearch Triangle Park, 

jtC 27709 
HAMS A1I0 ADDRESS OF THE PARTY TO 
THE VIABILITY STATEMENT IS ISSUED 



INTERNATIONAL FORK 
VTnflTT.TTT 8 



laeued pursuant to Rule 10.2 by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified at the bottom at this page 



X. nePoaiTQR 



Kane i Novartia AO 

Novartia corporation 
Addr eaei 3054 cornvallle Rd. 

Research Triangle Park, 

NC 27709 



II. IDENTIFICATION OF THE MICROORGANISM 



Depositor* a t ax oncotic deeignation and 
aeceaaion number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY i 
Smehariohl* call NRRL B-3007B 

Date ott Ootober 28, 199B 

1 Original Deposit 
D 9 New Deposit 

□ 1 Repropagatlon at Original Deposit 



TTT. fal VIABILITY STATEMENT 



Deposit wsa found i viable Nonviable on October 31, 199 B (Date) 

International Depositary Authority's preparation waa found viable on pacaaoac a, isifiDat**' 



TTI. (a) DEPOSITOR'S EQUIVALENCY DECLARATION 



Depoeitor determined the International Depositary Authority 'a preparation was 

H 1 Equivalent D 1 Not equivalent to deposit on tzllSB. <°* feB > 

Signature of Depositor T^BfcV ^S/L^ 



IV- CONDITIONS UNDER WHICH THE VIABILITY TEST NAB PERFORMED {Deposit or a /De positary) 1 



7*Ae. drt«o Cult^f*- luo^& py+ into SWv£a ^JWvpGaj^^*/} ow\«i ^rewn *> 



«r?ec 



V. INTERNATIONAL DEPOSITARY AUTHORITY 



Hanoi Agricultural Reeeeroh Culture 
Collection (NRRL) 
International Depoeltary Authority 

Addreeei 1815 N. Unlverelty Street 
Tlltnnis 61604 U.S.A. 



Slgnsture(e) of pereon(0) having the power 
to repreoent the international Depoeltary 
Authority or of euthorlsad offloial(S>i 

/J 

PfttBl — — 



• Indict. tha die* ot thm original OapoJit or whan * nav dapoalt has baan aada. 

• nik with a erooa tfia applioaala bos. 

• tncha n tarred to in aula iO.IUHill and Ull». rafar to tha a»at reeant vtiblllty feaac 
t in i£ tha iAtocnatlon ha* baoa raquaacad. 
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BUDAPEST TREATY ON THE INTERNATIONAL 
RBCOQJmON OF THE DEPOSIT OF MICBOOBBMX 
FOR THB PURPOSE OP PATENT PROCEDURES 



TO 



Nova rr is Corp. 
c/o Novartie AG 
p. 0. Box 12257 
Research Triangle Park, 



NC 27709 



RECEIPT IN THE CASS OF AN ORIGINAL DEPOSIT 
issued pursuant to Rule 7 . i by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified at the bat t can of this page 



HAMS AND ADDRESS 
OF DEPOSITOR 



Identification reference given by the 
DEPOSITOR : 


Acceaeion number given by che 
INTERNATIONAL DEPOSITARY AUTHORITY : 


Bacteria up. pCIB 9359-7 


NRRL B- 21835 



IDENTIFICATION OF THE MICROORGANISM 



II. SCIENTIFIC DESCRIPTION AND /OR PROPOSED TA3CONOMTC DESIGNATION 



The microorganism identified under I. above van accompanied by: 
a scientific description 

f~] a proposed taxonomic daei gnat ion 

(Mark with a cross whare applicable) 



III. RECEIPT AND ACCEPTANCE 



This international Depositary Authority accepts the microorganism identified under I. 
above, which was received by it on September 17, 1997 (data of the original deposit) 1 



IV. RECEIPT OF REQUEST FOR CONVERSION 



The microorganism identified under I. above was received by this International 
Depositary Authority on (date of the original deposit) and a request 

to convert the original deposit to a deposit under the Budapest Treaty was received by 
it on (date of receipt of request for conversion) . 



V. INTERNATIONAL DEPOSITARY AUTHORITY 



Agricultural Research Culture 

Collection (NRRLJ 
International Depositary Authority 



Address: 1015 N . University Street 

Peoria, rilinois 61604 U.S.A. 



Signature (a) of person (a) having the power 
to represent the International Depositary 
Authority or of authorised official (s) : 

Date: ||- 0-*P 



Whare Rule 6.4(d) applies, such data ia Che date on which che ecacua of 
depositary authority was acquired. 
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BUDAPEST TREATY OK THE 

OF MICROORGANISMS TOR THE 



R2COC3NITI0H OF THE DEPOSIT 
OP PATENT PROCEDURES 



IOTEHKATXQHAL FORM 



TO 



Novartia Corp. 
c/o Novartifl AG 
p. 0. Bene 12257 

Research Triangle Park, NC 27709 

HAKB AND ADDRESS OF THE PARTY TO 
THE yiapTUTY 3TATKMBRT IS I380BP 



issued pursuant to Rule 10.2 by the 
XNXraSATXOHAL DEPOSITARY AUTHORITY 
i den tified at the bottom of thia page 



T DEPOSITOR 



II. IDSHTIFICATIOg OF THE XT 



Movartia Corp 
e/o Novartia AO 
Address: P. O. Box 12257 

Raaaareh Triangle Park, 
HC 27709 



iaitor»B taxonomic daaignation and 
aeeeseion number givan by the 
IHTERHATTOIIAL DEPOSITARY AUTHORITY: 
Baoteria ap. NRRL B* 21835 

Date of September 17, 1997 
IS * Original Deposit 
Q 1 New Deposit*. 

O 2 Reprepagatlon of Original Deposit 



Ttt fal VIABILITY STJU-gflffnTT 



Deposit was found: Viable Nonviable on September 18 r 1997 (Date) 

International Depositary Authority's preparation waa found viable on September 25 , 
1997 (Date) 3 



TTI. (b) DEPOSITOR'S EQOTVALSHCY DECLARATION 



Depoeicor determined the International Depoeitary Authority a preparation 

Q : Equivalent D * Not equivalent to deposit on — — — — 

Signature of Depositor ________ 



(Date) 



tv CONDITIONS TOPER WHICH THE VIABILITY TEST HAS PEW™™™ ' Depoai tors /Depositary ) % 



INTERNATIONAL DEPOSITARY AUTHORITY 



Name: Agricultural Raaaareh Culture 
Collection (NRRL) 
International Depositary Authority 

a ddr ess: 1815 N. University Street 
p«~ia. Illinoia 61604 U.S.A. 



Signature <e) of person (a) having the power 
to represent tfto International Depositary 
Authority or of authorised officiaKe): 



» tndlcata tha data of tha original dapoalt ox whan a now deposit r,ua boon mn-Je. 
i M»rfc with * croafl tho applinaoJw 

• Intna oaaaa fatacrad to in (tula 10.21*1(1 II ano (HI). mf«f ► * «»oiit Mwne vj.auiJ.Xty taac. 

• rill in If tha Information haa boon raquoatod. 
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What is claimed is: 

1 . An isolated nucleic acid molecule comprising: 

(a) a nucleotide sequence substantially similar to a nucleotide sequence 
selected from the group consisting of: nucleotides 412-1665 of SEQ ID NO:1 , 
nucleotides 1686-2447 of SEQ ID NO:1 t nucleotides 2758-3318 of SEQ ID 
NO:1, nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515-9269 of 
SEQ ID NO:1, nucleotides 15,171-18,035 of SEQ ID NO:11, and nucleotides 
31 ,393-35,838 of SEQ ID NO:1 1 ; 

(b) a nucleotide sequence comprising nucleotides 23,768-31,336 of SEQ ID 
NO:11;or 

(c) a nucleotide sequence isocoding with the nucleotide sequence of (a) or (b); 
wherein expression of said nucleic acid molecule results in at least one toxin that is active 
against insects. 

2. An isolated nucleic acid molecule comprising a 20 base pair nucleotide portion 
identical in sequence to a consecutive 20 base pair nucleotide portion of a nucleotide 
sequence selected from the group consisting of: nucleotides 412-1665 of SEQ ID NO:1, 
nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, 
nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515-9269 of SEQ ID NO:1, 
nucleotides 15,171-18,035 of SEQ ID NO:11, and nucleotides 31,393-35,838 of SEQ ID 
NO:11, wherein expression of said nucleic acid molecule results in at least one toxin that is 
active against insects. 

3. An isolated nucleic acid molecule comprising a nucleotide sequence from 
Photorhabdus luminescens selected from the group consisting of: nucleotides 412-1665 of 
SEQ ID NO:1, nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID 
NO:1 f nucleotides 3342-4118 of SEQ ID NO:1, nucleotides 4515-9269 of SEQ ID NO:1 f 
nucleotides 66-1898 of SEQ ID NO:11, nucleotides 2416-9909 of SEQ ID NO:11, the 
complement of nucleotides 2817-3395 of SEQ ID NO:11, nucleotides 9966-14,633 of SEQ 
ID NO:11, nucleotides 14,699-15,007 of SEQ ID NO:11, nucleotides 15,171-18,035 of SEQ 
ID NO:1 1 , the complement of nucleotides 1 7,072-1 7,398 of SEQ ID NO:1 1 , the complement 
of nucleotides 18,235-19,167 of SEQ ID N0:11, the complement of nucleotides 19,385- 
20,116 of SEQ ID NO:1 1, the complement of nucleotides 20,217-20,963 of SEQ ID NO:11, 



WO 99/42589 



- 73 - 



PCT/EP99/01015 



the complement of nucleotides 22,172-23,086 of SEQ ID NO:1 1 , nucleotides 23,768-31,336 
of SEQ ID NO:11, nucleotides 31,393-35,838 of SEQ ID NO:11, the complement of 
nucleotides 35,383-35,709 of SEQ ID NO:11, the complement of nucleotides 36,032-36,661 
of SEQ ID NO:1 1 , and the complement of nucleotides 36,654-37,781 of SEQ ID NO:1 1 . 

4. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence is substantially similar to nucleotides 412-1665 of SEQ ID NO:1, nucleotides 
1686-2447 of SEQ ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, nucleotides 3342- 
41 18 of SEQ ID NO:1, or nucleotides 4515-9269 of SEQ ID NO:1 . 

5. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes an amino acid sequence selected from the group consisting of SEQ ID 
NOs:2-6. 

6. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence comprises nucleotides 412-1665 of SEQ ID NO: 1, nucleotides 1686-2447 of SEQ 
ID NO:1, nucleotides 2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1, 
or nucleotides 4515-9269 of SEQ ID NO:1. 

7. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence is substantially similar to nucleotides 15,171-18,035 or 31,393-35,838 of SEQ ID 
NO:11. 

8. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes the amino acid sequence set forth in SEQ ID NOs:12-14. 

9. An isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence comprises nucleotides 15,171-18,035; 23,768-31,336; or 31,393-35,838 of SEQ 
ID NO:11. 

10. An isolated nucleic acid molecule according to claim 2, comprising a 20 base pair 
nucleotide portion identical in sequence to a consecutive 20 base pair nucleotide portion of 
nucleotides 412-1665 of SEQ ID NO:1, nucleotides 1686-2447 of SEQ ID NO:1, nucleotides 
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2758-3318 of SEQ ID NO:1, nucleotides 3342-4118 of SEQ ID NO:1, or nucleotides 4515- 
9269 of SEQ ID NO:1. 

11. An isolated nucleic acid molecule according to claim 2, comprising a 20 base pair 
nucleotide portion identical in sequence to a consecutive 20 base pair nucleotide portion of 
nucleotides 15,171-18,035 or 31 t 393-35 T 838 of SEQ ID NO:11. 

12. A chimeric gene comprising a heterologous promoter sequence operatively linked to 
the nucleic acid molecule of claim 1 or claim 2. 

13. A recombinant vector comprising the chimeric gene of claim 12. 

14. A host cell comprising the chimeric gene of claim 12. 

15. A host cell according to claim 14, which is a bacterial cell. 

16. A-host cell according to claim 14, which is a yeast cell. 

17. A host cell according to claim 14, which is a plant cell. 

18. A plant comprising the plant cell of claim 17. 

19. A plant according to claim 18, which is maize. 

20. A toxin produced by the expression of a DNA molecule according to claim 1 or claim 
2. 

21 . A toxin according to claim 20, wherein said toxin has activity against Lepidopteran 
insects. 

22. A toxin according to claim 21 , wherein said toxin has activity against Plutella xylostella 
(Diamondback Moth), Trichoplusia ni (Cabbage Looper), Ostrinia nubilalis (European Com 
Borer), Heliothis virescens (Tobacco Budworm), Helicoverpa zea (Com Earworm), 
Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall Armyworm). 
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23. A toxin according to claim 20, wherein said toxin has activity against Lepidopteran and 
Coleopteran insects. 

24. A toxin according to claim 23, wherein said toxin has insecticidal activity against 
Plutella xylostella (Diamondback Moth), Ostrinia nubilalis (European Corn Borer), and 
Manduca sexta (Tobacco Homworm), Diabrotica virgifera virgifera (Western Com 
Rootworm), Diabrotica undecimpunctata howardi (Southern Com Rootworm), and 
Leptinotarsa decimlineata (Colorado Potato Beetle). 

25. A toxin according to claim 20, wherein said toxin comprises an amino acid sequence 
selected from the group consisting of: SEQ ID NOs:2-6. 

26. A toxin according to claim 20, wherein said toxin comprises an amino acid sequence 
selected from the group consisting of: SEQ ID NOs:12-14. 

27. A composition comprising an insecticidally effective amount of a toxin according to 
claim 20. 

28. A method of producing a toxin that is active against insects, comprising: 

(a) obtaining the host cell of claim 14; and 

(b) expressing the nucleic acid molecule in said cell, which results in at least one 
toxin that is active against insects. 

29. A method of producing an insect-resistant plant, comprising introducing a nucleic acid 
molecule according to claim 1 into said plant, wherein said nucleic acid molecule is 
expressible in said plant in an effective amount to control insects. 

30. A method of controlling insects comprising delivering to the insects an effective 
amount of a toxin according to claim 44. 



31 . The method of claim 29 or claim 30, wherein the insects are Lepidopteran insects. 
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32. The method of claim 31 , wherein the insects are selected from the group consisting 
of: Plutella xylostella (Diamondback Moth), Trichoplusia ni (Cabbage Looper), Ostrinia 
nubilalis (European Com Borer), Heliothis virescens (Tobacco Budworm), Helicoverpa zea 
(Corn Earworm), Spodoptera exigua (Beet Armyworm), and Spodoptera frugiperda (Fall 
Armyworm). 

33. The method of claim 29 or claim 30, wherein the insects are Lepidopteran and 
Coleopteran insects. 

34. The method of claim 33, wherein the insects are selected from the group consisting 
of: Plutella xylostella (Diamondback Moth), Ostrinia nubilalis (European Com Borer), and 
Manduca sexta (Tobacco Homworm), Diabrotica virgifera virgifera (Western Corn 
Rootworm), Diabrotica undecimpunctata howardi (Southern Corn Rootworm), and 
Leptinotarsa decimlineata (Colorado Potato Beetle). 

35. The method of claim 30, wherein the toxin is delivered to the insects orally. 

36. A method for mutagenizing a nucleic acid molecule according to claim 1 , wherein the 
nucleic acid molecule has been cleaved into population of double-stranded random 
fragments of a desired size, comprising: 

(a) adding to the population of double-stranded random fragments one or more 
single- or double-stranded oligonucleotides, wherein said oligonucleotides 
each comprise an area of identity and an area of heterology to a double- 
stranded template polynucleotide; 

(b) denaturing the resultant mixture of double-stranded random fragments and 
oligonucleotides into single-stranded fragments; 

(c) incubating the resultant population of single-stranded fragments with a 
polymerase under conditions which result in the annealing of said single- 
stranded fragments at said areas of identity to form pairs of annealed 
fragments, said areas of identity being sufficient for one member of a pair to 
prime replication of the other, thereby forming a mutagenized double- 
stranded polynucleotide; and 
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(d) repeating the second and third steps for at least two further cycles, wherein 
the resultant mixture in the second step of a further cycle includes the 
mutagenized double-stranded polynucleotide from the third step of the 
previous cycle, and wherein the further cycle forms a further mutagenized 
double-stranded polynucleotide. 
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SEQUENCE LISITN3 
<110> Novartis AG 

<120> Novel Toxins And Uses Thereof 

<130> PI/5-30421/A/OGC 1963 

<140> 
<141> 

<160> 22 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 9717 
<212> ENA 

<213> Photorhabdus lurainescens 

<220> 
<221> CDS 

<222> (412).. (1665) 
<223> orfl -46.4 fcDa 

<220> 
<221> CDS 

<222> (1686) (2447) 
<223> orf2 ~28.1kDa 

<220> 
<221> CDS 

<222> (2758) .. (3318) 
<223> orf3 -20.7 kDa 

<220> 
<221> CDS 

<222> (3342).. (4118) 
<223> orf4 -28.7 KDa 

<220> 
<221> CDS 

<222> (4515) .. (9269) 
<223> orf5 -176 kDa 

<400> 1 

gaattcatat gctatgaaat aaacagttgg cgcaataatt aaagctatta tttttatttt 60 

gtttttatac aatgatatgc tttattaaac agaataatga gttaatgata aataaatcct 120 

cgggatttat catgatatta tggccgaatg tgatgtgaac aattatttta taattagat't 180 

aataatataa tggtattaaa ataacaatat atttattcat gggtatttat catcggtttt 240 

attacatggg gaataatcta taaattagtt ttacataatt eacaaatagc gattccatta 300 

accaggaata ttaaaaatac ttatttatga ttatggtgat atatcttcat tagcctactt 360 

ttataactag aaaaattgac attttcaatc catgtataaa tggtaaccaa t atg cag 417 

Met Gin 
1 

aga get caa cga gtt gtt att aca ggt atg ggt gec gta aca ccg att 465 
Arg Ala Gin Arg Val Val He Thr Gly Met Gly Ala Val Thr Pro He 



5 



10 



15 
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ggt gaa gat gtt gaa tea tgt tgg caa agt att att gaa aaa caa cat 
Gly Glu Asp Val Glu Ser Cys Trp Gin Ser He He Glu Lys Gin His 
20 25 30 



gaa tat tac gat aaa tac gaa tgt ggc gta att ctt ggc agt ggt tgg 
Glu Tyr Tyr Asp Lys Tyr Glu Cys Gly Val He Leu Gly Ser Gly Trp 
100 105 110 



513 



cga ttt cac aga att gaa ttt cct gac tea ttc att aat teg cgt ttc 561 
Arg Phe His Arg He Glu Phe Pro Asp Ser Phe He Asn Ser Arg Phe 
35 40 45 50 

ttt tct ttc ctt gca cca aac cca tec cgc tat cag tta tta oca aaa 609 
Phe Ser Phe Leu Ala Pro Asn Pro Ser Arg Tyr Gin Leu Leu Pro Lys 
55 60 65 

aag ttg act cat aca ctt tct gac tgc gga aaa gca gca ttg aag gcg 657 
Lys Leu Thr His Thr Leu Ser Asp Cys Gly Lys Ala Ala Leu Lys Ala 
70 75 80 

act tat caa get ttt acc caa gca ttc ggc gtg aat ata tea cct gtt 705 
inr Tyr Gin Ala Hie Thr Gin Ala Phe Gly Val Asn He Ser Pro Val 
85 90 95 



753 



gga get att gat aat gee gga gat cat get tgc caa tat aag caa gca 801 
Gly Ala He Asp Asn Ala Gly Asp His Ala Cys Gin Tyr Lys Gin Ala 
115 120 125 130 

aaa tta get cat cct atg agt aat ctt att acc atg cca age tec atg 849 
Lys Leu Ala His Pro Met Ser Asn Leu He Tnr Met Pro Ser Ser Met 
135 140 145 

acg get gca tgt teg att atg tat gga eta cgt ggt tat caa aat acc 897 
Tnr Ala Ala Cys Ser He Met Tyr Gly Leu Arg Gly Tyr Gin Asn Thr 
150 155 160 

gtt atg get gee tgc gca acg ggc aca atg gcg ata ggc gat gee ttt 945 
Val Met Ala Ala Cys Ala Thr Gly Thr Met Ala He Gly Asp Ala Phe 
165 * 170 175 

gaa att att cgc tea ggg egg gca aaa tgt atg att gee gga gee get 993 
Glu He He Arg Ser Gly Arg Ala Lys Cys Met He Ala Gly Ala Ala 
180 185 190 

gaa tea etc acg egg gaa tgt aat att tgg agt att gat gta ctg aat 1041 
Glu Ser Leu Thr Arg Glu Cys Asn He Trp Ser He Asp Val Leu Asn 
195 200 " 205 210 

gca tta teg gaa caa gcg gac cca aat ctt gca tgt tgt cca ttt 1089 
Ala Leu Ser Lys Glu Gin Ala Asp Pro Asn Leu Ala Cys Cys Pro Phe 
215 220 225 

age ctt gat cgc tct gga ttt gta tta gee gaa gga gcg gcg gta gtt 1137 
Ser Leu Asp Arg Ser Gly Phe Val Leu Ala Glu Gly Ala Ala Val Val 
230 235 240 

tgt ctg gaa aat tat gat tea gee ate gcg cgt ggt gca acg att tta 1185 
Cys Leu Glu Asn Tyr Asp Ser Ala He Ala Arg Gly Ala Thr He Leu 
245 250 255 

gcg gaa att aaa ggt tac gee caa tat tea gat gee gtt aat tta acc 1233 
Ala Glu He Lys Gly Tyr Ala Gin Tyr Ser Asp Ala Val Asn Leu Thr 
260 265 270 
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cgg cca aca gaa gat att gaa cct aaa ata tta gcg ata act aaa gcc 1281 
Arg Pro Thr Glu Asp lie Glu Pro Lys lie Leu Ala lie Thr Lys Ala 
275 280 285 290 

att gag cag gca cag att teg ccg aaa gat att gac tac att aat get 1329 
lie Glu Gin Ala Gin lie Ser Pro Lys Asp lie Asp Tyr lie Asn Ala 
295 300 - 305 

cat ggt act tct aca ccg tta aat gat ctt tat gaa act cag gca att 1377 
His Gly Thr Ser Thr Pro Leu Asn Asp Leu Tyr Glu Thr Gin Ala lie 
310 315 320 

aaa gca gca ctg ggc caa tat get tat cag gta cct ata tea age aca 1425 
Lys Ala Ala Leu Gly Gin Tyr Ala Tyr Gin Val Pro He Ser Ser Thr 
325 330 335 

aaa tct tat ace ggc cac ctt att get gcc gcc ggt agt ttt gaa acg 1473 
Lys Ser Tyr Thr Gly His Leu lie Ala Ala Ala Gly Ser Phe Glu Thr 
340 345 350 

att gta tgt gtg aaa gca tta get gaa aat tgc ttg cca gca aca ttg 1521 
lie Val Cys Val Lys Ala Leu Ala Glu Asn Cys Leu Pro Ala Thr Leu 
355 360 365 370 

aat tta cac egg gcc gat cca gat tgc gat etc aat tat ttg cct aat 1569 
Asn Leu His Arg Ala Asp Pro Asp Cys Asp Leu Asn Tyr Leu Pro Asn 
375 380 385 

caa cat tgc tac ace get caa cca gag gtg aca etc aat att age gca 1617 
Gin His Cys Tyr Thr Ala Gin Pro Glu Val Thr Leu Asn He Ser Ala 
390 395 400 

ggt ttc ggc ggg cat aac get gcg ttg gtt ate get aag gta agg taa 1665 
Gly Phe Gly Gly His Asn Ala Ala Leu Val lie Ala Lys Val Arg 
405 410 415 

ctgatatgtt gatttttgea atg gaa gat att gaa cat tgg teg aat ttc tct 1718 
Met Glu Asp He Glu His Trp Ser Asn Phe Ser 
420 425 

ggg gat ttt aac ccc ate cat tat teg gcg aaa age gag tct ttg cgc 1766 
Gly Asp Phe Asn Pro He His Tyr Ser Ala Lys Ser Glu Ser Leu Arg 
430 435 440 445 

aat ata cag- caa cac ccg gtg cag gga atg ttg agt ttg etc tat gta 1814 
Asn He Gin Gin His Pro Val Gin Gly Met Leu Ser Leu Leu Tyr Val 
450 455 460 

egg caa cag ttt tct caa tta act tec get ttt aca acg gga ata ttg 1862 
Arg Gin Gin Phe Ser Gin Leu Thr Ser Ala Phe Thr Thr Gly He Leu 
465 470 475 

aac att gat gcc tct ttc cgc cag tat gtt tat ace gca tta ccc cat 1910 
Asn He Asp Ala Ser Phe Arg Gin Tyr Val Tyr Thr Ala Leu Pro His 
480 485 490 

caa ctg agg att aat act aaa aac aaa acg ttt aaa tta gaa aat ccc 1958 
Gin Leu Arg He Asn Thr Lys Asn Lys Thr Phe Lys Leu Glu Asn Pro 
495 500 505 

agt aaa gaa aac acg ttg ttc ggc aat ace age gta gag aat aca atg 2006 
Ser Lys Glu Asn Thr Leu Phe Gly Asn Thr Ser Val Glu Asn Thr Met 
510 515 520 525 

gag tea att gaa gat tgg ate gtt cag gat aat tgt caa aaa eta acg 2054 
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Glu Ser He Glu Asp Trp He Val Gin Asp Asn Cys Gin Lys Leu Thr 
530 535 540 

ata aca ggg gag gaa gtt tgt gaa aag tat get gtc ttt aga tac tat 2102 
He Thr Gly Glu Glu Val Cys Glu Lys Tyr Ala Val Phe Arg Tyr Tyr 
545 550 555 

ttc cca agt gtc act tct att gga tgg ttc ctg gat gcg ctt get ttt 2150 
Phe Pro Ser Val Inr Ser He Gly Trp Phe Leu Asp Ala Leu Ala Phe 
560 565 570 

cat ctt att att aat teg aca gga ttt ctt aat ttt gag cac tac cat 2198 
His Leu He He Asn Ser Inr Gly Phe Leu Asn Phe Glu His Tyr His 
575 580 585 

ttt aac caa tta cag gat tat ctg agt caa tct ttt act ttg cat act 2246 
Phe Asn Gin Leu Gin Asp Tyr Leu Ser Gin Ser Phe Ihr Leu His Inr 
590 595 600 605 

ggg caa gcg att aaa ate agg aag gag att gtt aat agt aca gta tta 2294 
Gly Gin Ala He Lys He Arg Lys Glu He Val Asn Ser Thr Val Leu 
610 615 620 

tta tct tea ccg gat ate tgt gtt gaa tta aat cct cct tta ttg att 2342 
Leu Ser Ser Pro Asp \ lie Cys Val Glu Leu Asn Pro Pro Leu Leu He 
625 630 635 

aag aat ggc gat aaa gat tat att cgt att ttc tat tat cga tgt tta 2390 
Lys Asn Gly Asp Lys Asp Tyr He Arg He Phe Tyr Tyr Arg Cys Leu 
640 645 650 

tat gat aaa aaa cct att ttt gta tea aag act tea att ate tct aag 2438 
Tyr Asp Lys Lys Pro He Phe Val Ser Lys Thr Ser He He Ser Lys 
655 660 665 

atg aaa taa aaggaaagcg aaatgecaac acaaagtgat attttcactg 2487 

Met Lys 

670 

aaataaagaa tagaatatta atgatgaagg atatagaaga tgaagaaata acaccagagt 2547 

cctcttttgt ttcgcttgaa tttgatagtc ttgactatgt ggaaatccaa gtttttgtgt 2607 

tggaagcgta tggtattgtg ettaaagecg aacttttttc aaatcattct atttcaacat 2667 

taaatgagct cactgactat ttaaaatcaa aattgtaatc tgaattttta cttaattatg 2727 

ttttttcacc attaacatta agaggttata atg aac gtt tta gaa caa ggt aag 2781 

Met Asn Val Leu Glu Gin Gly Lys 
675 680 

gtt get get tta tat tea gee tat teg gaa aca gaa ggt tct teg tgg 2829 
Val Ala Ala Leu Tyr Ser Ala Tyr Ser Glu Thr Glu Gly Ser Ser Trp 
685 690 695 

gtg gga aac ttg tgc tgt ttt tea agt gat egg gag cat ttg cct att 2877 
Val Gly Asn Leu Cys Cys Phe Ser Ser Asp Arg Glu His Leu Pro He 
700 705 710 

ate gtg aat ggg cgt cgt ttc ttg att gaa ttt gtt att cca gat cat 2925 
He Val Asn Gly Arg Arg Phe Leu He Glu Phe Val He Pro Asp His 
715 720 725 

tta ctt gat aaa acg gtt aaa ccc aga gta ttc gat ttg gat ate aat 2973 
Leu Leu Asp Lys Thr Val Lys Pro Arg Val Phe Asp Leu Asp He Asn 
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730 735 740 

aaa caa ttt tta ctg cgt cgt gac cat cgt gag ata aat att tat ctt 3021 
Lys Gin Phe Leu Leu Arg Arg Asp His Arg Glu He Asn He Tyr Leu . 
745 750 755 760 

tta ggt gaa gga aat ttt atg gat agg acg acg aca gat aaa aat eta 3069 
Leu Gly Glu Gly Asn Phe Met Asp Arg Thr Thr Thr Asp Lys Asn Leu 
765 770 775 

ttc gag tta aat gag gat ggt tea eta ttt att aag acg tta cgc cat 3117 
Phe Glu Leu Asn Glu Asp Gly Ser Leu Phe He Lys Thr Leu Arg His 
780 785 ' 790 

get ctt ggt aaa tat gtt get att aat cct tea act acg caa ttt ate 3165 
Ala Leu Gly Lys Tyr Val Ala He Asn Pro Ser Thr Thr Gin Phe He 
795 800 805 

ttc ttt gca caa gga aag tac agt gaa ttt ate atg aat gee tta aag 3213 
Phe Phe Ala Gin Gly Lys Tyr Ser Glu Phe He Met Asn Ala Leu Lys 
810 815 820 

aca gtt gaa gac gaa tta tea aaa cgt tat cga gtc aga att att cct 3261 
Thr Val Glu Asp Glu Leu Ser Lys Arg Tyr Arg Val Arg He He Pro 
825 830 835 ~ 840 

gaa ttg caa ggg ccg tat tat ggc ttt gaa ctt gat att ctt tct att . 3309 
Glu Leu Gin Gly Pro Tyr Tyr Gly Phe Glu Leu Asp He Leu Ser He 
845 850 * 855 

aca get taa ttca c aatat tatggagagt gtt atg gaa aag aaa ata aca aca 3362 
Thr Ala Met Glu Lys Lys He Thr Thr 

860 865 

ttt ace att gag aaa act gat gac aat ttt tat get aat ggg cgt cat 3410 
Phe Thr He Glu Lys Thr Asp Asp Asn Hie Tyr Ala Asn Gly Arg His 
870 875 880 

caa tgt atg gta aaa ate tct gta ctt aaa caa gaa tat agg aat ggt 3458 
Gin Cys Met Val Lys He Ser Val Leu Lys Gin Glu Tyr Arg Asn Gly 
885 890 895 

gat tgg ata aaa tta gca ctt agt gag get gaa aaa aga teg att cag 3506 
Asp Trp He Lys Leu Ala Leu Ser Glu Ala Glu Lys Arg Ser He Gin 
900 905 910 

gtg gcg gca tta agt gat age etc ata tat gac caa tta aaa atg cct 3554 
Val Ala Ala Leu Ser Asp Ser Leu He Tyr Asp Gin Leu Lys Met Pro 
915 920 925 ~ 930 

tea ggt tgg aca acg aca gat gca aga aat aaa ttt gat ctt ggg tta 3602 
Ser Gly Trp Thr Thr Thr Asp Ala Arg Asn Lys Phe Asp Leu Gly Leu 
935 940 945 

tta aat ggt gtt tat cat get gat get ttt att gac gaa cag gta aca 3650 
Leu Asn Gly Val Tyr His Ala Asp Ala Phe He Asp Glu Gin Val Thr 
950 955 960 

gat cgt gcg gga gat tgc tgc aca aat gaa aac tat cag aac agt gtg 3698 
Asp Arg Ala Gly Asp Cys Cys Thr Asn Glu Asn Tyr Gin Asn Ser Val 
965 970 975 

aaa agt gtt cct gaa att ate tat cgt tat gtc agt agt aat aga aca 3746 
Lys Ser Val Pro Glu He He Tyr Arg Tyr Val Ser Ser Asn Arg Thr 
980 985 990 
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age aca gaa tac eta atg gca aaa atg aca ttt gaa gat acg gat ggg 3794 
Ser Thr Glu Tyr Leu Met Ala Lys Met Thr Phe Glu Asp Thr Asp Gly 
995 " 1000 1005 1010 

aaa cgc aca tta aca acg aat atg tea gtt ggt gat gaa gtt ttt gac 3842 
Lvs Arg Thr Leu Thr Thr Asn Met Ser Val Gly Asp Glu Val Phe Asp 
1015 1020 1025 

age aag gtt tta tta aaa gec att get cct tat gca att aat aca aat 3890 
Ser Lys Val Leu Leu Lys Ala lie Ala Pro Tyr Ala He Asn Thr Asn 
1030 1035 1040 

caa ttg cat gaa aac ate aat aca ttg ttt gat aaa aca gaa gag ccg 3938 
Gin Leu His Glu Asn He Asn Thr Leu Phe Asp Lys Thr Glu Glu Pro 
1045 1050 1055 

aca aaa tec gat act cat cat caa ata att aat ctt tat cgc tgg aca 3986 
Thr Lys Ser Asp Thr His His Gin He He Asn Leu Tyr Arg Trp Thr 
1060 1065 1070 

ttg cca tat cat ttg agg att ctt gaa ggg aat gac agt act gtt aat 4034 
Leu Pro iyr His Leu Arg He Leu Glu Gly Asn Asp Ser Thr Val Asn 
1075 1 1080 1085 1090 

aga ata tat gtc ctt ggt aaa gag cca tea aat gat aga ttc ctg aca * 4082 
Arg He Tyr Val Leu Gly Lys Glu Pro Ser Asn Asp Arg Phe Leu Thr 
1095 HOO 1105 

aga gga agg gta ttt aaa cga gga act cat atg tga atgcacgtga 4128 
Arg Gly Arg Val Phe Lys Arg Gly Thr His Met 
1110 1U5 

taatgtgagt ggaggatgtg ttatggacta tgettatace gtaactattc cggacacgca 4188 

gettgetget gaagtgcttc atgtgacagg gtgttcgtgg acgagtggtt attatgatgg 4248 

atatcatgat gtcacaatca ttgataacta cggttgtcag cataaattta gaatttcttc 4308 

ggttaatatt ggacgtgcgc taagcatagc gagaataagt tgattttcct tagtaaaaaa 4368 

cctttgttta tgctggtaaa cgcatgtgcg tttgecagea attaatatat tccattattg 4428 

aaataggaat atagecatat ctgtaattat acataaacga atttttactc gaatataatt 4488 

ttaattgatc aaacaggaaa tttaaa atg aaa get acc gat ata tat tec aat 4541 

Met Lys Ala Thr Asp He Tyr Ser Asn 
1120 1125 

get ttt aat ttc ggt tct tat att aat act ggt gtc gat ccc aga aca 4589 
Ala Phe Asn Ehe Gly Ser Tyr He Asn Thr Gly Val Asp Pro Arg Thr 
1130 1135 1140 

ggt caa tat agt gca aat att aat att ate acg tta aga cct aat aat 4637 
Gly Gin Tyr Ser Ala Asn He Asn He He Thr Leu Arg Pro Asn Asn 
1145 1150 1155 

gtg ggt aat teg gaa caa aca ttg age eta tea ttc teg cca tta aca 4685 
Val Gly Asn Ser Glu Gin Thr Leu Ser Leu Ser Phe Ser Pro Leu Thr 
1160 1165 1170 1175 

acg tta aac aat ggc ttt ggt att ggc tgg cgc ttt tea tta aca aca 4733 
Thr Leu Asn Asn Gly Phe Gly He Gly Trp Arg Phe Ser Leu Thr Thr 
1180 H85 1190 
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tta gat ata aaa aca ctt aca ttt age cga gca aat ggg gag caa ttt 
Lai Asp He Lys Thr Leu Thr Phe Ser Arg Ala Asn Gly Glu Gin Phe 
1195 1200 1205 

aaa tgt aag cca ttg ccg cct aat aat aat gat ctt agt ttt aaa gat 
Lys Cys Lys Pro Leu Pro Pro Asn Asn Asn Asp Leu Ser Phe Lys Asp 
1210 1215 1220 

aaa aaa eta aaa gat ttg cgc gta tat aag etc gat age aat act ttt 
Lys Lys Leu Lys Asp Leu Arg Val Tyr Lys Leu Asp Ser Asn Thr Phe 
1225 1230 1235 

tat gtt tat aac aaa aac ggc att ata gag ata ctt aaa cga att ggg 
Tyr Val Tyr Asn Lys Asn Gly He He Glu He Leu Lys Arg He Gly 
1240 1245 1250 1255 

teg agt gat att gca aaa aca gtt gca ctt gaa ttt cct gat ggt gaa 
Ser Ser Asp He Ala Lys Ihr Val Ala Leu Glu Phe Pro Asp Gly Glu 
1260 1265 1270 

gca ttt gat tta att tat aat tea aga ttt gca ttg tec gaa ata aaa 
Ala Phe Asp Leu He Tyr Asn Ser Arg Phe Ala Leu Ser Glu He Lys 
1275 1280 1285 

tac cgt gtg aca ggt aaa act tat ctt aaa etc aat tac tct gga aat 
Tyr Arg Val Thr Gly Lys Thr Tyr Leu Lys Leu Asn Tyr Ser Gly Asn 
1290 1295 1300 

aac tgt aca tea gtg gaa tac cct gat gat aat aat att tct gcg aaa 
Asn Cys Ihr Ser Val Glu Tyr Pro Asp Asp Asn Asn lie Ser Ala Lys 
1305 1310 1315 

ata gca ttc gat tat cgt aac gat tac ctt att acg gtg act gta cct 
lie Ala Phe Asp Tyr Arg Asn Asp Tyr Leu He Thr Val Thr Val Pro 
1320 1325 1330 1335 

tac gat get tct ggt cct att gat tct gee cga ttt aag atg acc tat 
iyr Asp Ala Ser Gly Pro He Asp Ser Ala Arg Phe Lys Met Thr Tyr 
1340 1345 1350 

cag aca tta aaa ggc gta ttt cca gtt ate age acc ttc cgt aca cca 
Gin Thr Leu Lys Gly Val Phe Pro Val He Ser Thr Phe Arg Thr Pro 
1355 1360 1365 

acc ggt tat gtt gag ctg gtg agt tat aaa gag aat ggg cat aaa gtg 
Thr Gly Tyr Val Glu Leu Val Ser Tyr Lys Glu Asn Gly His Lys Val 
1370 1375 1380 

acg gac acg gaa tat att cct tat gcg get gca etc act att caa ccc 
Thr Asp Thr Glu Tyr He Pro Tyr Ala Ala Ala Leu Thr He Gin Pro 
1385 1390 1395 

ggc aat gga caa cct gcg gtc age aaa tec tat gaa tat agt tea gta 
Gly Asn Gly Gin Pro Ala Val Ser Lys Ser Tyr Glu Tyr Ser Ser Val 
1400 1405 1410 1415 

cat aac ttc ttg ggc tat tct tct ggc egg aca age ttt gat tec agt 
His Asn Phe Leu Gly Tyr Ser Ser Gly Arg Thr Ser Phe Asp Ser Ser 
1420 1425 1430 

caa gat aat ttg tat ttg gtc aca ggg aaa tac act tat tea tec att 
Gin Asp Asn Leu Tyr Leu Val .Thr Gly Lys Tyr Thr Tyr Ser Ser He 
1435 1440 1445 

gaa egg gtt tta gat ggt caa agt gtg gtt tea gta ata gaa cga gta 
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Giu Arg Val Leu Asp Gly Gin Ser Val Val Ser Val lie Glu Arg Val 
1450 1455 1460 

ttt aat aaa ttc cat tta atg acc aaa gaa gca aaa aca caa gat aat 5597 
Phe Asn Lys Phe His Leu Met Thr Lys Glu Ala Lys Thr Gin Asp Asn 
1465 1470 1475 

aag aga att aca aca gaa att act tac aat gag gat eta tea aaa agt 5645 
Lys Arg lie Thr Thr Glu He Thr Tyr Asn Glu Asp Leu Ser Lys Ser 
1480 1485 1490 1495 

ttc tea gag caa cca gaa aat tta caa caa cct tct cgc gtg tta acc 5693 
Phe Ser Glu Gin Pro Glu Asn Leu Gin Gin Pro Ser Arg Val Leu Ihr 
1500 1505 1510 

cgt tat acg gat ata caa aca aat act tea cga gaa gag act gtc aat 5741 
Arg Tyr Thr Asp He Gin Ihr Asn Tnr Ser Arg Glu Glu Ihr Val Asn 
1515 1520 1525 

att aaa agt gat gat tgg gga aat act eta ctt att act gag acc agt 5789 
He Lys Ser Asp Asp Trp Gly Asn Tnr Leu Leu He Ihr Glu Ihr Ser 
1530 1535 1540 

ggg ata cag aaa gaa tac gtt tat tat ccg gtc aat ggc gaa ggt aat 5837 
Gly He Gin Lys Glu Tyr Val Tyr Tyr Pro Val Asn Gly Glu Gly Asn 
1545 1550 1555 

agt tgc cct gec gat ccc ttg ggt ttt tct egg ttc tta aaa tea gtt 5885 
Ser Cys Pro Ala Asp Fro Leu Gly Phe Ser Arg Phe Leu Lys Ser Val 
1560 1565 1570 1575 

acg caa aaa gga teg cct gat get get caa agt gtc gca aat aaa gtg 5933 
Thr Gin Lys Gly Ser Pro Asp Ala Ala Gin Ser Val Ala Asn Lys Val 
1580 1585 1590 

att cat tat aca tat caa aaa ttt cct act ttt acc ggc get tat gtt 5981 
He His Tyr Thr Tyr Gin Lys Phe Pro Thr Phe Thr Gly Ala Tyr Val 
1595 1600 1605 

aag gaa tat gtc agt aaa gtc tea gag acg ata gac aat aaa ata gcg 6029 
Lys Glu Tyr Val Ser Lys Val Ser Glu Thr He Asp Asn Lys He Ala 
1610 1615 1620 

aga acc ttt age tat gtt aac tea ccg acg agt aaa tct cat ggt teg 6077 
Arg Thr Phe Ser Tyr Val Asn Ser Pro Thr Ser Lys Ser His Gly Ser 
1625 1630 1635 

tta gca aaa ata acg tea gtg atg aat aac cag caa acg gtc acc aca 6125 
Leu Ala lys He Thr Ser Val Met Asn Asn Gin Gin Thr Val Thr Thr 
1640 * 1645 1650 1655 

ttt aaa tat gaa tat tea gaa agt gag atg acc aca aat get acg gtg 6173 
Phe Lys Tyr Glu Tyr Ser Glu Ser Glu Met Thr Thr Asn Ala Thr Val 
1660 1665 1670 

acc ggt ttt gat ggc gca cat atg gaa teg aaa aat gtg acg tct att 6221 
Thr Gly the Asp Gly Ala His Met Glu Ser Lys Asn Val Thr Ser He 
1675 1680 1685 

tat acc cat egg caa ctt cgt aaa gtt gat gta aac cac gtg att acc 6269 
Tyr Thr His Arg Gin Leu Arg Lys Val Asp Val Asn His Val He Thr 
1690 1695 1700 

gat cag tct tat gat ctt ttg ggt cgc att aca ggg caa att att gat 6317 
Asp Gin Ser Tyr Asp Leu Leu Gly Arg He Thr Gly Gin He He Asp 
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1705 1710 1715 

ccc ggc acg gca aga gaa att aaa cgt aat tac gtt tat caa tat ccc 6365 
Pro Gly Thr Ala Arg Glu lie Lys Arg Asn Tyr Val Tyr Gin Tyr Pro 
1720 1725 1730 1735 

ggc ggt gac gaa aat gat ttt tgg ccg gtg atg ata gaa gtt gat tct 6413 
Gly Gly Asp Glu Asn Asp Phe Trp Pro Val Met lie Glu Val Asp Ser 
1740 1745 1750 

caa ggc gtc aga cgt aaa acc cat tac gat gga atg gga cgt att tgt 6461 
Gin Gly Val Arg Arg Lys Thr His Tyr Asp Gly Met Gly Arg He Cys 
1755 1760 1765 

teg att gaa gaa caa gat gat gat ggc gec tgg ggc aca teg ggg att 6509 
Ser He Glu Glu Gin Asp Asp Asp Gly Ala Trp Gly Thr Ser Gly He 
1770 1775 1780 

tat caa ggc aca tat cga aaa gtt ctt gee aga caa tat gat gtt ttg 6557 
Tyr Gin Gly Thr Tyr Arg Lys Val Leu Ala Arg Gin Tyr Asp Val Leu 
1785 1790 1795 

ggg cag ttg age aag gaa att tea aat gat tgg tta tgg aat tta tct 6605 
Gly Gin Leu Ser Lys Glu He Ser Asn Asp Trp Leu Trp Asn Leu Ser 
1800 1805 1810 1815 

gee aat cct ttg gtt cgt ctt get acc ccg ttg gtt aca acg aaa acc 6653 
Ala Asn Pro Leu Val Arg Leu Ala Thr Pro Leu Val Thr Thr Lys Thr 
1820 1825 1830 

tat aaa tat gat ggt tgg gga aat ctt tac age acg gaa tac agt gat 6701 
Tyr Lys Tyr Asp Gly Trp Gly Asn Leu Tyr Ser Thr Glu Tyr Ser Asp 
1835 1840 1845 

ggt egg ata gag ctg gaa ate cat gat cct att acg agg aca att act 6749 
Gly Arg He Glu Leu Glu He His Asp Pro He Thr Arg Thr He Thr 
1850 1855 1860 

caa ggg gtc aaa gga tta ggg atg tta aat att cag caa aat aat ttt 6797 
Gin Gly Val Lys Gly Leu Gly Met Leu Asn He Gin Gin Asn Asn Phe 
1865 1870 1875 

gag caa ccg get teg ate aaa get gtg tat cct gat ggt acg ata tat 6845 
Glu Gin Pro Ala Ser He Lys Ala Val Tyr Pro Asp Gly Thr He Tyr 
1880 1885 1890 1895 

age acc cgt act tat cgt tat gat gga ttt ggt cgt aca gtg acg gaa 6893 
Ser Thr Arg Thr Tyr Arg Tyr Asp Gly Phe Gly Arg Thr Val Thr Glu. 

1900 1905 1910 

aca gat gca gaa ggt cat get acc caa att gga tat gat gtg ttt gat 6941 
Thr Asp Ala Glu Gly His Ala Thr Gin He Gly Tyr Asp Val Phe Asp 
1915 1920 1925 

cgt ata gtg aaa aaa acg ttg cca gac gga aca ata tta gaa tec get 6989 
Arg He Val Lys Lys Thr Leu Pro Asp Gly Thr lie Leu Glu Ser Ala 
1930 1935 1940 

tat gca age ttt age cat gaa gaa tta att teg gca ctg aac gtg aat 7037 
Tyr Ala Ser Phe Ser His Glu Glu Leu He Ser Ala Leu Asn Val Asn 
1945 1950 1955 

ggc aca cag ttg ggg gca tta gtt tat gat ggt ctt ggg egg gta ata 7085 
Gly Thr Gin Leu Gly Ala Leu Val Tyr Asp Gly Leu Gly Arg Val He 
1960 1965 1970 1975 
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agt gat acg gtg ggt ggt cgc aaa acg gaa tat tta tat ggg cct caa 7133 
Ser Asp Thr Val Gly Gly Arg Lys Thr Glu Tyr Leu Tyr Gly Pro Gin 
1980 1985 1990 



ggt gac aaa ccg att cag tea att act cct teg cat aat aag caa aat 
Gly Asp Lys Pro He Gin Ser lie Thr Pro Ser His Asn Lys Gin Asn 
1995 2000 2005 

atg gat tac etc tac tat ctt ggt agt gtg atg tec aaa ttt ace acg 
Met Asp Tyr Leu Tyr Tyr Leu Gly Ser Val Met Ser Lys Phe Thr Thr 
2010 2015 2020 

ggg aca gac caa caa aac ttt cgt tat cat teg aaa acg gga aca tta 
Gly Thr Asp Gin Gin Asn Phe Arg Tyr His Ser Lys Thr Gly Thr Leu 
2025 2030 2035 



tta tec caa tta gtg aca aaa ate gaa tat gat get ttt gat cga gaa 
Leu Ser Gin Leu Val Thr Lys lie Glu Tyr Asp Ala Phe Asp Arg Glu 
2140 2145 2150 



ggg gtg gtt atg aaa aat gaa cgt tat caa tat gat aat aat caa cgc 
Gly Val Val Met Lys Asn Glu Arg Tyr Gin Tyr Asp Asn Asn Gin Arg 
2185 2190 2195 

tta age caa tac caa tgt gag gga gaa caa tct ccg att gat cat acg 
Leu Ser Gin Tyr Gin Cys Glu Gly Glu Gin Ser Pro He Asp His Thr 
2200 2205 2210 2215 



7181 



7229 



7277 



tta tct gcg tea gaa ggc gta tct cag act aat tac agt tat ttc cca 7325 
Leu Ser Ala Ser Glu Gly Val Ser Gin Thr Asn Tyr Ser Tyr Phe Pro 
2040 2045 2050 2055 

teg ggt gta tta cag cga gaa tea ttt tta egg gat aat aaa ccg att 7373 
Ser Gly Val Leu Gin Arg Glu Ser Phe Leu Arg Asp Asn Lys Pro He 
2060 2065 2070 

tea teg ggc gag tac ctt tat acg atg tec ggt ttg att caa cgt cat 7421 
Ser Ser Gly Glu Tyr Leu Tyr Thr Met Ser Gly Leu He Gin Arg His 
2075 2080 2085 

aaa gat agt ttt ggt cat aat cat gtt tat agt tac gat get cag gga 7469 
Lys Asp Ser Phe Gly His Asn His Val Tyr Ser Tyr Asp Ala Gin Gly 
2090 2095 2100 

aga ttg gtc aaa aca gaa cag gat gca caa tac get aca ttt gaa tat 7517 
Arg Leu Val Lys Thr Glu Gin Asp Ala Gin Tyr Ala Thr Phe Glu Tyr 
2105 2110 2115 

gac aat gtt ggg cga ttg ata aca acg acg ace aaa gac acg acg tea 7565 
Asp Asn Val Gly Arg Leu He Thr Thr Thr Thr Lys Asp Thr Thr Ser 
2120 2125 2130 2135 



7613 



ata aaa cgc teg eta att agt gac ttc tea ata caa gtt att ace tta 7661 
He Lys Arg Ser Leu He Ser Asp Phe Ser He Gin Val He Thr Leu 
2155 2160 2165 

age tat acg aag aat aat caa ate agt caa cgt ate acc tec ate gat 7709 
Ser Tyr Thr Lys Asn Asn Gin He Ser Gin Arg He Thr Ser He Asp 
2170 2175 2180 



7757 



7805 



ggt cgt gta tta aat cag cag att tac cat tat gac caa tgg gga aat 7853 
Gly Arg Val Leu Asn Gin Gin He Tyr His Tyr Asp Gin Trp Gly Asn 
2220 2225 2230 
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att aag egg. etc gat aat aca tat cga gat ggt aag gaa acg gtg gat 7901 
lie Lys Arg Leu Asp Asn Thr Tyr Arg Asp Gly Lys Glu Thr Val Asp 
2235 2240 2245 

tat cat ttc agt caa gee gat cca act caa ctt att cgt att acc age 7949 
Tyr His Phe Ser Gin Ala Asp Pro Thr Gin Leu He Arg He Thr Ser 
2250 2255 2260 

gac aaa cag cag ata gag tta agt tat gat get aat ggc aac eta aca 7997 
Asp Lys Gin Gin He Glu Leu Ser Tyr Asp Ala Asn Gly Asn Leu Ihr 
2265 2270 2275 

cgt gac gaa aaa ggg caa acg etc att tac gat cag aat aat cgc ttg 8045 
Arg Asp Glu Lys Gly Gin Ihr Leu He Tyr Asp Gin Asn Asn Arg Leu 
2280 2285 2290 2295 

gta cag gtc aaa gac egg ttg ggc aat ctg gtg tgc age tac cag tat 8093 
Val Gin Val Lys Asp Arg Leu Gly Asn Leu Val Cys Ser Tyr Gin Tyr 
2300 2305 2310 

gat gca ttg aac aaa tta acc gca cag gtt ttg gcg aat ggt acc gtt 8141 
Asp Ala Leu Asn Lys Leu Thr Ala Gin Val Leu Ala Asn Gly Thr Val 
2315 2320 2325 

aat cga cag cat tat get tec ggt aaa gtg acg aat att caa ttg ggt 8189 
Asn Arg Gin His Tyr Ala Ser Gly Lys Val Thr Asn He Gin Leu Gly 
2330 2335 2340 

gat gaa gcg att act tgg ttg age agt gat aag caa cga att gga cat 8237 
Asp Glu Ala He Thr Trp Leu Ser Ser Asp Lys Gin Arg He Gly His 
2345 2350 2355 

caa age gee aag aat ggt caa tea gtc tac tat caa tat ggt att gac 8285 
Gin Ser Ala Lys Asn Gly Gin Ser Val Tyr Tyr Gin Tyr Gly He Asp 
2360 2365 2370 2375 

cat aac agt acg gtt ate gec agt cag aac gaa aac gag ttg atg get 8333 
His Asn Ser Thr Val He Ala Ser Gin Asn Glu Asn Glu Leu Met Ala 
2380 2385 2390 

tta tec tat aca cct tat ggc ttt agg agt tta att tec tea tta ccg 8381 
Leu Ser Tyr Thr Pro Tyr Gly Phe Arg Ser Leu He Ser Ser Leu Pro 
2395 2400 2405 

ggt ttg aat ggc gca cag gtt gat cca gta aca ggc tgg tac ttc tta 8429 
Gly Leu Asn Gly Ala Gin Val Asp Pro Val Thr Gly Trp Tyr Phe Leu 
2410 2415 2420 

ggt aac gga tat cgt gtt ttc aac ccg gtt etc atg agg ttt cac age 8477 
Gly Asn Gly Tyr Arg Val Phe Asn Pro Val Leu Met Arg Phe His Ser 
2425 2430 2435 

ccc gat agt tgg agt cct ttt ggt egg gga ggg att aac cct tat acc 8525 
Pro Asp Ser Trp Ser Pro Phe Gly Arg Gly Gly He Asn Pro Tyr Thr 
2440 2445 2450 2455 

tat tgc caa ggc gat ccc ata aac egg att gat ctg aac ggt cat ctt 8573 
Tyr Cys Gin Gly Asp Pro He Asn Arg He Asp Leu Asn Gly His Leu 
2460 2465 2470 

agt gee ggc ggg ata tta ggc att gtg eta ggg gca att ggc ate att 8621 
Ser Ala Gly Gly He Leu Gly He Val Leu Gly Ala He Gly He He 
2475 2480 2485 

gtc ggg att gta tea ctg gga gec gga gcg gcg att age gcg ggt etc 8669 
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Val Gly He Val Ser Leu Gly Ala Gly Ala Ala He Ser Ala Gly Leu 
2490 2495 2500 

att get gcg ggg ggc get ttg ggg gcg att get tct ace age gcg ctt 8717 
He Ala Ala Gly Gly Ala Leu Gly Ala He Ala Ser Thr Ser Ala Leu 
2505 2510 2515 



gca gtt act gcg act gtc att gga ttg get gee gat teg ata ggg att 
Ala Val Thr Ala Thr Val He Gly Leu Ala Ala Asp Ser He Gly He 
2520 2525 2530 2535 

gcg tea gca gca tta teg gaa aaa gat ccg aaa aca tct ggg ata tta 
Ala Ser Ala Ala Leu Ser Glu Lys Asp Pro Lys Thr Ser Gly He Leu 
2540 2545 2550 



8765 



8813 



aat tgg att agt gcg gga ttg ggg gtt tta age ttt ggt ate age gca 8861 
Asn Trp He Ser Ala Gly Leu Gly Val Leu Ser Phe Gly lie Ser Ala 
2555 2560 2565 

ata ace ttt ace tct teg ctg gta aaa teg gca egg agt ggt tct cag 8909 
He Thr Phe Thr Ser Ser Leu Val Lys Ser Ala Arg Ser Gly Ser Gin 
2570 2575 2580 

gca gtc age gcg ggt gtt ate ggg tea gtg cct ctt gaa ttt ggt gaa 8957 
Ala Val Ser Ala Gly Val He Gly Ser Val Pro Leu Glu Phe Gly Glu 
2585 " 2590 2595 

gtt get age cgt tec age aga cga tgg gat att gcg tta tct teg ata 9005 
Val Ala Ser Arg Ser Ser Arg Arg Trp Asp He Ala Leu Ser Ser He 
2600 2605 2610 2615 

teg ttg ggc gca aat gcg gcg tct etc tct acg ggg ata gcg gcg gcg 9053 
Ser Leu Gly Ala Asn Ala Ala Ser Leu Ser Thr Gly He Ala Ala Ala 
2'620 2625 2630 

gcg gtt gca gac agt aat gcg aat gca get aat att ctg gga tgg gta 9101 
Ala Val Ala Asp Ser Asn Ala Asn Ala Ala Asn He Leu Gly Trp Val 
2635 2640 2645 

tec ttt ggt ttt ggt gca gta teg aca ace tea gga ata att gag ctt 9149 
Ser Phe Gly Phe Gly Ala Val Ser Ihr Thr Ser Gly He He Glu Leu 
2650 2655 2660 

acg cgt aca get tat gca gtg aat cat cag act tgg gaa ctg agt tea 9197 
Thr Arg Thr Ala Tyr Ala Val Asn His Gin Thr Trp Glu Leu Ser Ser 
2665 2670 2675 

tea gca ggt act teg gag gaa gtg aag cct ata cgt tgt etc gtt tea 9245 
Ser Ala Gly Thr Ser Glu Glu Val Lys Pro He Arg Cys Leu Val Ser 
2680 2685 2690 2695 

cac cgc tgg aat cag aag cag tga atgttaaccc tcctcgggca gttgagttaa 9299 
His Arg Trp Asn Gin Lys Gin 
2700 

tcaaacgttt cgaaatagta cegggaacta tttagccaat cgtccattga aaccegtaat 9359 
gtgttgcgac gtcgtttgac aatataaaga ttctgegaac cgattggtta agtctcacga 9419 
aaaataacta ttaggegaca tttgegtege cttttttaag gaactttatc aggttacatt 9479 
tataagaagc tattttgttt tegaeggatg ttggtttctc tgagataaaa aatagaggga 9539 
aatgatgtca agggtgataa tggttaattg taaaatatgt gatattattc gcatttatat 9599 
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gtcaatgtaa ttcctcttat tatttaattt tattgcattt gctacgcgaa atcgccttat. 9659 
aattttattt ttaataaatt attatttcat cattaaacta aaataaatta tttctaga 9717 



<210> 2 
<211> 417 
<212> PRT 

<213> Photorhabdus luminescens 
<400> 2 

Met Gin Arg Ala Gin Arg Val Val He Thr Gly Met Gly Ala Val Thr 
15 10 15 

Pro He Gly Glu Asp Val Glu Ser Cys Trp Gin Ser He He Glu Lys 
20 25 30 

Gin His Arg Phe His Arg He Glu Phe Pro Asp Ser Phe He Asn Ser 
35 40 45 

Arg Phe Phe Ser Phe Leu Ala Pro Asn Pro Ser Arg Tyr Gin Leu Leu 
50 55 60 

Pro Lys Lys Leu Thr His Thr Leu Ser Asp Cys Gly Lys Ala Ala Leu 
65 70 75 80 

Lys Ala Ihr Tyr Gin Ala Phe Thr Gin Ala Phe Gly Val Asn lie Ser 
85 90 95 

Pro Val Glu Tyr Tyr Asp Lys Tyr Glu Cys Gly Val He Leu Gly Ser 
100 105 110 

Gly Trp Gly Ala He Asp Asn Ala Gly Asp His Ala Cys Gin Tyr Lys 
115 120 125 

Gin Ala Lys Leu Ala His Pro Met Ser Asn Leu He Ihr Met Pro Ser 
130 135 140 

Ser Met Ihr Ala Ala Cys Ser He Met Tyr Gly Leu Arg Gly Tyr Gin 
145 150 155 " 160 

Asn Ihr Val Met Ala Ala Cys Ala Thr Gly Ihr Met Ala He Gly Asp 
165 170 175 

Ala Phe Glu He He Arg Ser Gly Arg Ala Lys Cys Met He Ala Gly 
180 185 190 

Ala Ala Glu Ser Leu Ihr Arg Glu Cys Asn He Trp Ser He Asp Val 
195 200 205 

Leu Asn Ala Leu Ser Lys Glu Gin Ala Asp Pro Asn Leu Ala Cys Cys 
210 215 220 

Pro Phe Ser Leu Asp Arg Ser Gly Phe Val Leu Ala Glu Gly Ala Ala 
225 230 235 240 

Val Val Cys Leu Glu Asn Tyr Asp Ser Ala He Ala Arg Gly Ala Thr 
245 250 255 

He Leu Ala Glu He Lys Gly Tyr Ala Gin Tyr Ser Asp Ala Val Asn 
260 265 270 

Leu Thr Arg Pro Thr Glu Asp He Glu Pro Lys He Leu Ala He Thr 
275 280 285 
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Lys Ala lie Glu Gin Ala Gin He Ser Pro Lys Asp He Asp Tyr He 
290 295 300 

Asn Ala His Gly Thr Ser Thr Pro Leu Asn Asp Leu Tyr Glu Thr Gin 
305 " 310 315 320 

Ala He Lys Ala Ala Leu Gly Gin Tyr Ala Tyr Gin Val Pro He Ser 
325 330 335 

Ser Thr Lys Ser Tyr Thr Gly His Leu He Ala Ala Ala Gly Ser Phe 
340 345 350 

Glu Thr He Val Cys Val Lys Ala Leu Ala Glu Asn Cys Leu Pro Ala 
355 360 365 . 

Thr Leu Asn Leu His Arg Ala Asp Pro Asp Cys Asp Leu Asn Tyr Leu 
370 375 380 

Pro Asn Gin His Cys iyr Thr Ala Gin Pro Glu Val Ihr Leu Asn He 
385 390 395 400 

Ser Ala Gly Phe Gly Gly His Asn Ala Ala Leu Val He Ala Lys Val 
405 410 415 



Arg 



<210> 3 
<211> 253 
<212> EKT 

<213> Photorhabdus luminescens 
<400> 3 

Met Glu Asp He Glu His Trp Ser Asn Phe Ser Gly Asp Phe Asn Pro 
15 10 15 

He His iyr Ser Ala Lys Ser Glu Ser Leu Arg Asn He Gin Gin His 
20 25 30 

Pro Val Gin Gly Met Leu Ser Leu Leu iyr Val Arg Gin Gin Phe Ser 
35 40 45 

Gin Leu lhr Ser Ala Phe Thr Thr Gly He Leu Asn He Asp Ala Ser 
50 55 60 

Phe Arg Gin Tyr Val Tyr Thr Ala Leu Pro His Gin Leu Arg He Asn 
65 70 75 80 

Thr Lys Asn Lys Thr Phe Lys Leu Glu Asn Pro Ser Lys Glu Asn Thr 
85 90 95 

Leu Phe Gly Asn Thr Ser Val Glu Asn Thr Met Glu Ser He Glu Asp 
100 105 110 

Trp He Val Gin Asp Asn Cys Gin Lys Leu Thr He Thr Gly Glu Glu 
115 120 125 

Val Cys Glu Lys Tyr Ala Val Phe Arg Tyr Tyr Phe Pro Ser Val Thr 
130 135 140 

Ser He Gly Trp Phe Leu Asp Ala Leu Ala Phe His Leu He He Asn 
145 ~ 150 155 160 

Ser Thr Gly Phe Leu Asn Phe Glu His Tyr His Phe Asn Gin Leu Gin 



165 



170 



175 
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Asp Tyr Leu Ser Gin Ser Phe Thr Leu His Thr Gly Gin Ala lie Lys 
180 185 190 

lie Arg Lys Glu lie Val Asn Ser Thr Val Leu Leu Ser Ser Pro Asp 
195 200 205 

lie Cys Val Glu Leu Asn Pro Pro Leu Leu lie Lys Asn Gly Asp Lys 
210 215 220 

Asp T>/r He Arg He Phe Tyr Tyr Arg Cys Leu Tyr Asp Lys Lys Pro 
225 230 235 240 

He Phe Val Ser Lys Thr Ser He He Ser lys Met Lys 



<210> 4 
<211> 186 
<212> PRT 

<213> Photorhabdus lurninescens 
<400> 4 

Met Asn Val Leu Glu Gin Gly lys Val Ala Ala Leu Tyr Ser Ala Tyr 
15 10 15 

Ser Glu Thr Glu Gly Ser Ser Trp Val Gly Asn Leu Cys Cys Phe Ser 
20 25 30 

Ser Asp Arg Glu His Leu Pro He He Val Asn Gly Arg Arg Phe Leu 
35 40 45 

He Glu Phe Val He Pro Asp His Leu Leu Asp lys Thr Val Lys Pro 
50 55 60 

Arg Val Phe Asp Leu Asp He Asn Lys Gin Phe Leu Leu Arg Arg Asp 
65 70 75 80 

His Arg Glu He Asn He Tyr Leu Leu Gly Glu Gly Asn Phe Met Asp 
85 90 95 

Arg Thr Thr Thr Asp Lys Asn Leu Phe Glu Leu Asn Glu Asp Gly Ser 
100 105 110 

Leu Phe He Lys Thr Leu Arg His Ala Leu Gly Lys Tyr Val Ala He 
115 120 125 

Asn Pro Ser Thr Thr Gin Phe He Phe Phe Ala Gin Gly Lys Tyr Ser 
130 135 140 

Glu Phe He Met Asn Ala Leu Lys Thr Val Glu Asp Glu Leu Ser Lys 
145 150 155 160 

Arg Tyr Arg Val Arg He He Pro Glu Leu Gin Gly Pro Tyr Tyr Gly 
165 170 175 

Phe Glu. Leu Asp He Leu Ser lie Thr Ala 



245 



250 



180 



185 



<210> 5 
<211> 258 
<212> PRT 

<213> Photorhabdus lurninescens 
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<400> 5 



Met Glu Lys Lys He Thr Thr Phe Thr He Glu Lys Thr Asp 
15 10 

Asp Asn Phe Tyr Ala Asn Gly Arg His Gin cys Met Val Lys He Ser 
15 ~ 20 25 30 

Val Leu Lys Gin Glu Tyr Arg Asn Gly Asp Trp He Lys Leu Ala Leu 
35 40 45 

Ser Glu Ala Glu Lys Arg Ser He Gin Val Ala Ala Leu Ser Asp Ser 
50 55 60 

Leu He Tyr Asp Gin Leu Lys Met Pro Ser Gly Trp Thr Thr Thr Asp 
65 70 75 

Ala Arg Asn Lys Phe Asp Leu Gly Leu Leu Asn Gly Val Tyr His Ala 
80 85 90 

Asp Ala Phe He Asp Glu Gin Val Thr Asp Arg Ala Gly Asp Cys Cys 
95 100 105 110 

Ihr Asn Glu Asn Tyr Gin Asn Ser Val Lys Ser Val Pro Glu He He 
115 120 125 

Tyr Arg Tyr Val Ser Ser Asn Arg Thr Ser Thr Glu Tyr Leu Met Ala 
130 135 140 

Lys Met Thr Phe Glu Asp Thr Asp Gly Lys Arg Ihr Leu Thr Ihr Asn 
145 150 155 

Met Ser Val Gly Asp Glu Val Phe Asp Ser Lys Val Leu Leu Lys Ala 
160 " 165 170 

He Ala Pro Tyr Ala lie Asn Ihr Asn Gin Leu His Glu Asn He Asn 
175 180 185 190 

Ihr Leu Phe Asp Lys Ihr Glu Glu Pro Ihr Lys Ser Asp Thr His His 
195 200 205 

Gin He He Asn Leu Tyr Arg Trp Thr Leu Pro Tyr His Leu Arg He 
210 215 220 

Leu Glu Gly Asn Asp Ser Thr Val Asn Arg He Tyr Val Leu Gly Lys 
225 230 235 

Glu Pro Ser Asn Asp Arg Phe Leu Thr Arg Gly Arg Val Phe Lys Arg 
240 245 250 

Gly Thr His Met 
255 



<210> 6 
<211> 1584 
<212> PRT 

<213> Photorhabdus luminescens 
<400> 6 

Met Lys Ala Thr Asp He Tyr Ser Asn Ala Phe Asn Phe Gly Ser Tyr 
15 10 15 



He Asn Thr Gly Val Asp Pro Arg Thr Gly Gin Tyr Ser Ala Asn He 
20 25 30 
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Asn He He Thr Leu Arg Pro Asn Asn Val Gly Asn Ser Glu Gin Thr 
35 40 45 

Leu Ser Leu Ser Phe Ser Pro Leu Thr Thr Leu Asn Asn Gly Phe Gly 
50 55 60 

He Gly Trp Arg Phe Ser Leu Thr Thr Leu Asp He Lys Thr Leu Thr 
65 70 75 80 

Phe Ser Arg Ala Asn Gly Glu Gin Phe Lys Cys Lys Pro Leu Pro Pro 
85 90 95 

Asn Asn Asn Asp Leu Ser Phe Lys Asp Lys Lys Leu Lys Asp Leu Arg 
100 105 " 110 

Val Tyr Lys Leu Asp Ser Asn Thr Phe Tyr Val Tyr Asn Lys Asn Gly 
115 120 125 

He He Glu He Leu Lys Arg He Gly Ser Ser Asp He Ala Lys Thr 
130 135 140 

Val Ala Leu Glu Phe Pro Asp Gly Glu Ala Hie Asp Leu He Tyr Asn 
145 150 155 160 

Ser Arg Phe Ala Leu Ser Glu He Lys Tyr Arg Val Thr Gly Lys Thr 
165 170 175 

Tyr Leu Lys Leu Asn Tyr Ser Gly Asn Asn Cys Thr Ser Val Glu Tyr 
180 185 190 

Pro Asp Asp Asn Asn lie Ser Ala Lys He Ala Phe Asp Tyr Arg Asn 
195 200 205 

Asp Tyr Leu He Thr Val Thr Val Pro Tyr Asp Ala Ser Gly Pro He 
210 215 220 

Asp Ser Ala Arg Phe Lys Met Thr Tyr Gin Thr Leu Lys Gly Val Phe 
225 230 235 240 

Pro Val He Ser Thr Phe Arg Thr Pro Thr Gly Tyr Val Glu Leu Val 
245 250 255 

Ser Tyr Lys Glu Asn Gly His Lys Val Thr Asp Thr Glu Tyr He Pro 
260 265 270 

Tyr Ala Ala Ala Leu Thr He Gin Pro Gly Asn Gly Gin Pro Ala Val 
275 280 285 

Ser Lys Ser Tyr Glu Tyr Ser Ser Val His Asn Phe Leu Gly Tyr Ser 
290 295 300 

Ser Gly Arg Thr Ser Phe Asp Ser Ser Gin Asp Asn Leu Tyr Leu Val 
305 310 315 320 

Thr Gly Lys Tyr Thr Tyr Ser Ser He Glu Arg Val Leu Asp Gly Gin 
325 330 335 

Ser Val Val Ser Val He Glu Arg Val Phe Asn Lys the His Leu Met 
340 345 350 

Thr Lys Glu Ala Lys Thr Gin Asp Asn Lys Arg He Thr Thr Glu He 
355 360 365 

Thr TVr Asn Glu Asp Leu Ser Lys Ser Phe Ser Glu Gin Pro Glu Asn 
370 375 380 
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Leu Gin Gin Pro Ser Arg Val Leu Thr Arg Tyr Thr Asp lie Gin Thr 
385 390 395 400 

Asn Thr Ser Arg Glu Glu Thr Val Asn He Lys Ser Asp Asp Trp Gly 
405 410 415 

Asn Thr Leu Leu He Thr Glu Ihr Ser Gly He Gin Lys Glu Tyr Val 
420 425 430 

Tyr Tyr Pro Val Asn Gly Glu Gly Asn Ser Cys Pro Ala Asp Pro Leu 
435 440 445 

Gly Phe Ser Arg Phe Leu Lys Ser Val Ihr Gin Lys Gly Ser Pro Asp 
450 455 460 

Ala Ala Gin Ser Val Ala Asn Lys Val He His Tyr Ihr Tyr Gin Lys 
465 470 475 480 

Phe Pro Thr Phe Ihr Gly Ala iyr Val Lys Glu Tyr Val Ser Lys Val 
485 490 495 

Ser Glu Thr He Asp Asn Lys He Ala Arg Ihr Phe Ser Tyr Val Asn 



Met Asn Asn Gin Gin Thr Val Ihr Thr Phe Lys Tyr Glu Tyr Ser Glu 
530 535 540 

Ser Glu Met Thr Thr Asn Ala Thr Val Thr Gly Phe Asp Gly Ala His 
545 550 555 560 

Met Glu Ser Lys Asn Val Thr Ser He Tyr Thr His Arg Gin Leu Arg 
565 570 575 

Lys Val Asp Val Asn His Val He Thr Asp Gin Ser Tyr Asp Leu Leu 
580 585 590 

Gly Arg He Thr Gly Gin He He Asp Pro Gly Thr Ala Arg Glu He 
595 600 605 

Lys Arg Asn Tyr Val Tyr Gin Tyr Pro Gly Gly Asp Glu Asn Asp Phe 
610 615 620 

Trp Pro Val Met He Glu Val Asp Ser Gin Gly Val Arg Arg Lys Thr 
625 630 635 640 

His Tyr Asp Gly Met Gly Arg He Cys Ser He Glu Glu Gin Asp Asp 
645 650 655 

Asp Gly Ala Trp Gly Thr Ser Gly He Tyr Gin Gly Thr Tyr Arg Lys 
660 665 670 

Val Leu Ala Arg Gin Tyr Asp Val Leu Gly Gin Leu Ser Lys Glu He 
675 680 685 

Ser Asn Asp Trp Leu Trp Asn Leu Ser Ala Asn Pro Leu Val Arg Leu 
690 695 700 

Ala Thr Pro Leu Val Thr Thr Lys Thr Tyr Lys Tyr Asp Gly Trp Gly 
705 710 715 720 

Asn Leu Tyr Ser Thr Glu Tyr Ser Asp Gly Arg He Glu Leu Glu He 



500 



505 



510 



Ser Pro Thr Ser Lys Ser His Gly Ser Leu Ala Lys He Thr Ser Val 
515 520 525 
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725 



730 



735 



His Asp Pro He Thr Arg Thr He Thr Gin Gly Val Lys Gly Leu Gly 
740 745 750 

Met Leu Asn He Gin Gin Asn Asn Phe Glu Gin Pro Ala Ser He Lys 
755 760 765 

Ala Val Tyr Pro Asp Gly Thr He Tyr Ser Thr Arg Thr Tyr Arg Tyr 
770 775 780 

Asp Gly Phe Gly Arg Thr Val Thr Glu Thr Asp Ala Glu Gly His Ala 
785 790 795 800 

Thr Gin He Gly Tyr Asp Val Phe Asp Arg He Val Lys Lys Thr Leu 
805 810 815 

Pro Asp Gly Thr He Leu Glu Ser Ala Tyr Ala Ser Phe Ser His Glu 
820 825 830 

Glu Leu He Ser Ala Leu Asn Val Asn Gly Thr Gin Leu Gly Ala Leu 
835 840 845 

Val Tyr Asp Gly Leu Gly Arg Val He Ser Asp Thr Val Gly Gly Arg 
850 855 860 

Lys Thr Glu Tyr Leu Tyr" Gly Pro Gin Gly Asp Lys Pro He Gin Ser 
865 870 875 880 

He Thr Pro Ser His Asn Lys Gin Asn Met Asp Tyr Leu Tyr Tyr Leu 
885 890 895 

Gly Ser Val Met Ser Lys Phe Thr Thr Gly Thr Asp Gin Gin Asn Phe 
' 900 905 910 

Arg Tyr His Ser Lys Thr Gly Thr Leu Leu Ser Ala Ser Glu Gly Val 
915 920 925 

Ser Gin Thr Asn Tyr Ser Tyr Phe Pro Ser Gly Val Leu Gin Arg Glu 
930 935 940 

Ser Phe Leu Arg Asp Asn Lys Pro He Ser Ser Gly Glu Tyr Leu Tyr 
945 950 955 960 

Thr Met Ser Gly Leu He Gin Arg His Lys Asp Ser Phe Gly His Asn 
965 970 975 

His Val Tyr Ser Tyr Asp Ala Gin Gly Arg Leu Val Lys Thr Glu Gin 



Thr Thr Thr Thr Lys Asp Thr Thr Ser Leu Ser Gin Leu Val Thr Lys 
1010 1015 1020 

He Glu Tyr Asp Ala Phe Asp Arg Glu lie Lys Arg Ser Leu He Ser 
025 1030 1035 1040 

Asp Phe Ser He Gin Val He Thr Leu Ser Tyr Thr Lys Asn Asn Gin 
1045 1050 1055 

lie Ser Gin Arg He Thr Ser He Asp Gly Val Val Met Lys Asn Glu 



980 



985 



990 



Asp Ala Gin Tyr Ala Thr Phe Glu Tyr Asp Asn Val Gly Arg Leu He 
995 1000 1005 



1060 



1065 



1070 
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Arg Tyr Gin Tyr Asp Asn Asn Gin Arg Leu Ser Gin Tyr Gin Cys Glu 
1075 1080 1085 

Gly Glu Gin Ser Pro lie Asp His Thr Gly Arg Val Leu Asn Gin Gin 
1090 1095 1100 

He Tyr His Tyr Asp Gin Trp Gly Asn He Lys Arg Leu Asp Asn Thr 
105 1110 1115 1120 

Tyr Arg Asp Gly Lys Glu Thr Val Asp Tyr His Phe Ser Gin Ala Asp 
1125 1130 1135 

Pro Thr Gin Leu He Arg He Thr Ser Asp Lys Gin Gin He Glu Leu 
1140 1145 1150 

Ser Tyr Asp Ala Asn Gly Asn Leu Thr Arg Asp Glu Lys Gly Gin Thr 
1155 1160 1165 

Leu He Tyr Asp Gin Asn Asn Arg Leu Val Gin Val Lys Asp Arg Leu 
1170 1175 1180 

Gly Asn Leu Val Cys Ser Tyr Gin Tyr Asp Ala Leu Asn Lys Leu Thr 
185 1190 1195 1200 

Ala Gin Val Leu Ala Asn Gly Thr Val Asn Arg Gin His Tyr Ala Ser 
1205 1210 1215 

Gly Lys Val Thr Asn He Gin Leu Gly Asp Glu Ala He Thr Trp Leu 
1220 1225 1230 

Ser Ser Asp Lys Gin Arg He Gly His Gin Ser Ala Lys Asn Gly Gin 
1235 1240 1245 

Ser Val Tyr Tyr Gin Tyr Gly He Asp His Asn Ser Thr Val He Ala 
1250 1255 1260 

Ser Gin Asn Glu Asn Glu Leu Met Ala Leu Ser Tyr Thr Pro Tyr Gly 
265 1270 1275 1280 

Phe Arg Ser Leu He Ser Ser Leu Pro Gly Leu Asn Gly Ala Gin Val 
1285 1290 1295 

Asp Pro Val Thr Gly Trp Tyr Phe Leu Gly Asn Gly Tyr Arg Val Phe 
1300 1305 1310 

Asn Pro Val Leu Met Arg Phe His Ser Pro Asp Ser Trp Ser Pro Phe 
1315 1320 1325 

Gly Arg Gly Gly He Asn Pro Tyr Thr Tyr Cys Gin Gly Asp Pro He 
1330 1335 1340 

Asn Arg He Asp Leu Asn Gly His Leu Ser Ala Gly Gly He Leu Gly 
345 1350 1355 1360 

He Val Leu Gly Ala He Gly He lie Val Gly He Val Ser Leu Gly 
1365 1370 1375 

Ala Gly Ala Ala He Ser Ala Gly Leu He Ala Ala Gly Gly Ala Leu 
1380 1385 - 1390 

Gly Ala He Ala Ser Thr Ser Ala Leu Ala Val Thr Ala Thr Val He 
1395 1400 1405 



Gly Leu Ala Ala Asp Ser He Gly He Ala Ser Ala Ala Leu Ser Glu 
1410 1415 1420 
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Lys Asp Pro Lys Thr Ser Gly He Leu Asn Trp He Ser Ala Gly Leu 
425 1430 1435 1440 

Gly Val Leu Ser Phe Gly He Ser Ala He Ihr Phe Thr Ser Ser Leu 
1445 1450 1455 

Val Lys Ser Ala Arg Ser Gly Ser Gin Ala Val Ser Ala Gly Val He 
1460 1465 1470 

Gly Ser Val Pro Leu Glu Phe Gly Glu Val Ala Ser Arg Ser Ser Arg 
1475 1480 1485 

Arg Trp Asp He Ala Leu Ser Ser He Ser Leu Gly Ala Asn Ala Ala 
1490 1495 1500 

Ser Leu Ser Thr Gly He Ala Ala Ala Ala Val Ala Asp Ser Asn Ala 
505 1510 1515 1520 

Asn Ala Ala Asn He Leu Gly Trp Val Ser Phe Gly Phe Gly Ala Val 
1525 1530 1535 

Ser Thr Thr Ser Gly He He Glu Leu Thr Arg Thr Ala Tyr Ala Val 
1540 1545 1550 

Asn His Gin Thr Trp Glu Leu Ser Ser Ser Ala Gly Thr Ser Glu Glu 
1555 1560 1565 

Val Lys Pro He Arg Cys Leu Val Ser His Arg Trp Asn Gin Lys Gin 
1570 1575 1580 



<210> 7 
<211> 18 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ; oligonucleotide 
<400> 7 

acacagcagg ttcgtcag 18 



<210> 8 
<211> 18 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 8 

ggcagaagca ctcaactc 18 



<210> 9 
<211> 20 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 



<400> 9 
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attgatagca cgcggcgacc 



<210> 10 
<211> 22 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 10 

ttgtaacgtg gagccgaact gg 22 



<210> 11 
<211> 37948 
<212> ENA 

<213> Photorhabdus lurainescens 

<220> 
<221> CDS 

<222> (15171).. (18035) 
<223> orf5 

<220> 
<221> CDS 

<222> (23768).. (31336) 
<223> hph2 

<220> 
<221> CDS 

<222> (31393).. (35838) 
<223> orf2 

<400> 11 

tgttgctgga ccgtggagat tatgcctatc gtcagttaga acgagacacg ctcaatgaag 60 
ccaagatgtg gtatatgcaa gcactgcatc tgttaggcga taaacctcat ctatcgttca 120 
gttcagagtg gagcaaaccg agtttaggcg acgctgccgg aacggaaaga caaaagcaac 180 
acgcccaagc aatggccgct ctgogacaag gtgatgttag tcggcacaac aacccgacag 240 
atcttttctt gccacaggtc aatgaagtga tgcaaaacta ttggcaaaaa ttggaacaac 300 
ggctgtataa cctgcgtcat aacctcacta ttgacggcca acogctacat ctgcctattt 360 
acgctacacc ggcagatcca aaagcattac ttagcgccgc tgtcgctagc tccgaaggtg 420 
gggtagctct ctcacagcca tttatgtcac tgtggcgttt cccacacatg ctggaaaacg 480 
cgcgtggtat ggtcagtcag ctcactcaat tcggctctac gctacaaaat attatcgaac 540 
gtcaggatgc ggaagcttta aacacgctct tgcagaatca agcagcagaa ctgatattga 600 
ctcatctcag catacaggac aaaaccatcg cagagctgga tgcggaaaaa atcgtactgg 660 
aaaaatccaa agccggggog caatcacgct ttgacagcta caaaaagtta tacgacgaaa 720 
atatcaatgc gggtgaaaac cgggctatag cattgcatgc ctccgttgct. ggcctcagca 780 
ctgccctgca agcatcacgt ctggcgggcg ctgcgcttga tctggcgccc aacattttcg 840 
gtctcgctga tggcggtagc cgttggggag cgattgccga agcgacaggt aatgttatgg 900 
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aattctccgc 


cagtgtgatg 


aacaccgaag 


cggataaaat cagccagtca gaagcctatc 


960 


gccgtcgccg 


tcaggaatgg 


gaaatccagc 


gtaatcatgc cgaagcagag ataaaacaga 


1020 


tcgatgctca 


acttcaatca 


ctggcagtac 


gccgtgaagc cgcggtattg cagaaagcca 


1080 


gcctaaaaac 


ccaacaggaa 


cagactcatg 


ctcaattgac tttcctgcaa cgtaaattca 


1140 


gtaatcaagc 


gttgtactac 


tggctacgcg 


gtcggctagc tgctatttac ttccaatttt 


1200 


acgatttggc 


cgtagcgcgt 


tgtctgatgg 


ctgaaatggc ttatcgttgg gagactaatg 


1260 


agaccgcggc 


aagctttatc 


aaacccggcg 


cctggcaggg aacccatgcg ggtttactgg 


1320 


ctggtgaaac 


cttgatgctg 


aatctggcgc 


aaatggaaga tgcocatttg aggtgggatc 


1380 


aacgogctct 


ggaagtggaa 


cggaccattt 


cattgacgca acactatgga gcactgccag 


1440 


aaaaatcgtt 


taatttagcc 


acacggattt 


ctaccctgct agcaggtggt acaactgact 


1500 


ccattgatga 


tcatcccgtt 


acat-tagaaa 


acgaccaact tagtgccaaa atctctctgt 


1560 


caggtctgtx 


attagataat 


gactacccag 


atggcaacgg cgtaggcaac attcgacgca 


1620 


ttaaacaaat 


cagtgtcacc 


ttgccagccc 


tgttaggacc atatcaggat gtacaagcta 


1680 


ttctgtccta 


cggaggaagt 


gaaatcggat 


tagctgaaag ctgtaaatca ctggcgatct 


1740 


ctcatgggat 


caatgacagt 


ggtcaattcc 


agttggattt taacaatggt aagttcctgc 


1800 


cgtttgaagg 


gattgcgatt 


aacgatactg 


gcacattgac actcagtttc coccaatgcg 


1860 


actgtcaaac 


aagaaaacat 


gttgcagact 


ttgagtgata ttattctgca tattcgctat 


1920 


accatccgcc 


aataaocacc 


tcaattaaat 


accaaaaaca ggctcctaaa cggggcctga 


1980 


acttttcacg 


aatatatacc 


actcacagtc 


tgctctcttt acctgtctga cgctcgttat 


2040 


aacagagata 


tttccttttc 


tcgtgagtcc 


catcacctac tataaaatat caaccctctt 


2100 


ctttttcata 


atatgcaata 


tgtaacaaat 


gcaattattt catttagtta ttgttaacta 


2160 


gttatattac 


ttatgatgta 


attataaatt 


ttgttattgc atcacaatag ccatttaaat 


2220 


aaataataac 


gttgtgaaat 


agttgatagt 


taaatggtgt ttttatttag ccgttatttt 


2280 


caacccaatt 


tcagaccgct 


atcagacgtt 


acctgtgttg cctttgtttt gatagatata 


2340 


aataacctta 


tttatatcca 


cggtactcag 


accagcataa atottttatt tacctaacat 


2400 


ttaaaaggaa 


taaacatgaa 


cacactcaaa 


tccgaatatg aaaacgcgtt agtagcaggt 


2460 


tttaataatc 


taaccgatat 


ttgtcatctc 


tcttttgacg aacttcgcaa aaaagtgaag 


2520 


gacaaactct 


catggtcaca 


gacccaaagc 


ttatatcttg aagcacagca agtgcaaaag 


2580 


gacaatctcc 


tgcatgaagc 


ccatattctg 


aaacgcgcca atcctcattt acaaagtgcg 


2640 


gtccatcttg 


ccctgacaac 


acctcatgct 


gaccagcaag gttataatag cagatttggc 


2700 


aatogcgcca 


gcaaatatac 




gcaatttctt ccatgttttc tcctgcggct 


2760 


tatttagctg 


aactttatcg 


tcaggcacgg 


aatttacatg atgaaaattc tatttatcat 


2820 
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ttggatacac ggcgtccgga tctaaaatca 
gagatttcca cactttctct gtctaatgac 
aaggacaagc tgctggaggc tctgaagaat 
gaggctttga agaatattaa acctctgtcc 
attaaacctc tgtccaagga cgatctgctg 
aaggacgatc tgctggaggc tttgaagaat 
gaggctttga agaatattaa acctctgtcc 
attaaacctc tgtccaagga cgatctgctg 
aaggacgatc tacaggaatg tattgaaatt 
atgaaagcgt tatccaattt ccgcgtttct 
agcgtgcgta aggttgttca attacaggct 
ctagaaacga ctatcgatga actaaaatat 
gtctcgccta aattatttac tatcttgact 
ctctatgagg aaaattttgg taatattaaa 
aaaagttatt acaatctgag tgatgaagag 
cttcttccag aagaagaaat agcaattact 
cagactatcg aaaatcctga ttatcgtgct 
ttcaaagcta taaacttatc accgacggta 
gaattcaata tcaataaaga aatattacaa 
cgttatggta ttgacactga gactgcatta 
atcaatgacg gacatctaag tcagtttgat 
caagattttt ccgtcaatgg tactcagaat 
tggaataaaa cagtacttaa acgtgctttt 
ctaaaaatta ccaatccggt caatactacc 
tctcatctct ataggacaaa attactggca 
gggttactgt tggaagccat aggtaaagga 
aatctggtta ctctaattaa caaactctat 
tggagtgtct atcagttgtt tatgatgact 
gaaatacgga atttactgga taccgtctac 
ttgaaagctg aagaagatct agagaaaacc 
ctggaaaaat tcccggaaaa ccagccacaa 
agactgaata aagctgaaga gacccacgaa 
aaaaatctgc caaaagccat atctccttat 



ttggtgctca 


gccagaaaaa 


tatggatacg 2880 


atgttgctag 


agggtattaa 


aactctgttc 2940 


attaaatctc 


tgtccaagga 


cgagctgctg 3000 


aaggacgatc 


tgctggaggc 


tttgaagaat 3060 


gaggctttga 


agaatattaa 


acctctgtcc 3120 


attaaacctc 


tgtccaagga 


cgatctgctg 3180 


aaggacgatc 


tgctggaggc 


tttgaagaat 3240 


gaggctttga 


agaatattaa 


acctctgtcc 3300 


ctattcaatc 


tggacagcca 


cactaaagta 3360 


ggcatgatgc 


catatcacga 


tgcttatgaa 3420 


ccggtgtttg 


aacacgttgt 


tagtacatca 3480 


caagcttctt 


tgttggaaat 


taattcttct 3540 


gaagaaatta 


ctacaatcaa 


tgcaagaagt 3600 


ccttctctaa 


taggaaaacc 


ggaatatctg 3660 


tttagcgatt 


tcattaaaat 


aagaactata 3720 


gatcttgcat 


cgcgtactac 


tagtacacaa 3780 


ctattgaaaa 


ttaataagtt 


tattcgtcta 3840 


ttaagtggaa 


tcctccgcag 


catcagcaca 3900 


aaaatctttc 


gtgttaaata 


ctatatgcaa 3960 


atactatgca 


aggtaccaat 


ttcacaatat 4020 


cgtttattta 


attcccccaa 


actgaatggt 4080 


attgatttaa 


ccctaagcag 


taccaacaac 4140 


aacctcgatg 


atatctcatt 


aaatcgacta 4200 


gaaatgataa 


ctaatgatat 


agagaatctt 4260 


gatatccatc 


agttaactat 


tgatgaactg 4320 


acaaccaatt 


tatctgagat 


tactcctgac 4380 


gctgtcacta 


gctggctacg 


tacacaaaag 4440 


actgataaat 


ataacaaaac 


cctaaccccg 4500 


aatggcttgc 


aagattttaa 


taagaagatg 4560 


aaaaagaaat 


tgcagagcgc 


caaggaaaat 4620 


ctccaagaag 


acaggaaaaa 


agcccagaga 4680 


aaagccgaga 


aaaacctaga 


tgaggtcagg 4740 




ctctgcaatt 


accatctgaa 4800 
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CatgCy"y*Cat 


attccatact 


catcfcgggca 


gataatctgg 




4860 


acagcggaaa 


aattatggaa 


ctiQcrttcicxjcj 


aaaaatcccg 


ttacauctca acctaaattc 


4920 


CaaaaacaaQ 


ctgaacctgt 


ggtccagtat 


fcCCCaOCCTCC 

*»3 * M ** M **- l4 y *-*y w V» 






taccgttcta 


ccggccttaa 
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5040 


tttgttatta 


aaaccaaagc 


acccgaaaca 
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acactaatgt 


cactaacgcg 


tfcfctactgac 


tacrcrttaact 
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m fin 
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aatcttgata 


aaaatctact 


ggagcaagcc 
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tggccatcta 


tcgacaacct 




attaacatct 


caccrtcaat"t" nasrai-r't-r't- 








gacfccaagta 


ttgaccgcag 






caatgqqaaa 


acgccgcfcgc 


gatattaacc 


y u.v_y yy \_ uyy 






Ctacafcgcgt 


ttctxroatcra 




y v— i^y \_.y >— Lua 


y\ -civ. q i_cn^ uci Ual^uucLuuCl. 




cataaccaaa 


aagatcgaga 




catacagtaa 


t"t~aaaAar*rTr f~r-ra t-rrat-r- t-:a 


DDOU 


tatcaatacc 


tattgatcga 


taaccaagtfc 


fcCCOCCOaca 

v*^^»y w wy 


nxxanau w*l a.yciyci.LAjy L. L. 




craaactatca 


ctagtatcca 


aetata tatt 


fl^fr'S ^r 1 "/'Iff if 

oouiaj ^.y i—y i_ 


i. ynnnactt.ciu yyciyyyciyau 


D /uu 


actgtcacaa 


gtgtcaccag 




ttcaeeaaf*t* 


yyyaLfldaia Caclt-aaacgc 


D /ou 


tacagcactt 


aaaccaatat 






^^y^yclcll-Uci CclUCyaLCCg 


DOZU 


acgctzacg"&a 


t tCHOfl l' ^OSC 


aaaaatigatjg 


yen— ci\_y u. L.y i— 


Lgt/iatocau cagocaaagc 


CQQA 


caafctaaata 


tcgataccgt 


agaagatgee 


tttaaatctt 






gtggctaatc 


tggaaatcct 


cagcgcctac 


catgacaaca 


t taa taa tcra t raaana f t-s 


ouuu 


acttacttta 


tcggacgtag 




gtgaatcaat 


MbM^wM^vy t*y L»y L.yyciL> 


6060 


cacaataaat 




taaattcccc 


getaatgect 


uuctuLaaata atacaaaat-t 




gattgtccaa 


ttaaccccta 


caaagatact 


attcacccgg 


taattttcca atcto»Yto 


61 RO 

OiOU 


tatcttatct 


ggctggaaca 


aaaaaaggcg 


aefcaaacagg 


aaquLuataa aaccaccf.frcr 


6240 


yy uuacuauu 


atgaactgaa 


attagegcat 


atccgttatg 


acggcacctg gaatacacca 


6300 


gtcacctttg 


atgtaaacca 


aaaaatatcc 


gatttaaatc 


tgggaaataa aacacctgga 


6360 


ctttactgct 


caagctttca 


aggcagagat 


gaattgctgg 


tgatgtttta taaaaaacaa 


6420 


gatcaattaa 


atcaatacac 


aaacacagta 


ccaataaaag 


gactatatat cacttccaat 


6480 


atgtcttcta 


aggaaatgac 


acctgaaaat 


cacaaaccta 


aegcttataa acagtttgat 


6540 


actaatagta 


ttattggtgt 


caataatege 


tatgeagaaa 


gctacgaaat cccttcatca 


6600 


gtaaatagta 


ataacggtta 


tgattgggga 


gatggctatc 


tgagtatggt gtatggcgga 


6660 


aatatttcag 


ccatcaaact 


ggagtcctca 


tcagataagt 


taaaactctc accaaggtta 


6720 



WO 99/42589 PCT/EP99/01015 

-26- 



agaattattc ataatggact tgtaggcoga caacgcaacc aatgcaacct gatgaagaaa 6780 
tacggtcagc ttggtgataa atttattatt tatactactc taggtattaa ccccaataat 6840 
ttgtcgaata aaaaattcat ttaccctgtt tatcagtata gtgggaacac taccaataat 6900 
gagaaaggac gtctgctgtt ttatcgagaa agtactacta actttgtaag agcctggttc 6960 
cctaaccttc cctctggctc tcaagaaatg tccacaacca ctggcggtga cattagtggt 7020 
aactatggtt atattgataa caaacatagt gacgatgttc catttaaaca atatttctat 7080 
atggatgacc acggtggtat tgacactgat gtttcaggga tattatctat taatacgaac 7140 
attaatcatt caaaagttaa agtaatagtg aaagccgaag gtatcacaga gcaaactttt 7200 
gtagcgagcg aaaacagtaa tgtccccacc aatccgtccc gcttcgaaga aatgaattat 7260 
cagtttaaag agcttgaaat agatatctcc acactgacat ttcataataa tgaagcaagt 7320 
attgatatca cctttatcgc atttgctgag aaatttgacg ataatagtaa tgatcgtaac 7380 
ttaggcgaag aacatttcag tattcgtatt atcaaaaaag cggaaactga taatgccctg 7440 
accctgcacc ataatgcaaa cggggcgcaa tatatgcagt ggggaaactc ttgtattcgc 7500 
ct ta atacgc tatttgcccg tcaattaatt agccgagcca acgcggggat agatactatt 7560 
ttgagcatgg acactcagaa tattcaggaa cctaaattag gagaagattc tcctgatgct 7620 
atggaaccaa tggacttcaa cggcgccaac agcctctatt tctgggaact gttctactac 7680 
accccgatgc tgattgctca acgtttgctg cacgaacaaa acttcgatga ggctaaccgt 7740 
tggctgaaat atgtctggaa cccatccggt tatattgtca atggtcaaat gcaacattac 7800 
cgctggaatg ttcgcccatt acaagaagac actagttgga aogatgatcc gttggattca 7860 
tttgatcctg ataccatagc tcaacatgat ccaatgcact acaaagtcgc cacctttatg 7920 
cgcaccctag atctgttgat cgaacgggga gattacgcct atcgccaatt ggagcgggac 7980 
acactcgctg aagccaaaat gtggtatatg caggcactgc atctattggg tgataaacct 8040 
catctaccac tcagttcagc atggaatgat ccagagctag aagaggccgc agctcttgaa 8100 
aaacaacagg cacatgccaa agaaatagca gatttaogac aaggacttcc tacatccaca 8160 
gggtctaaag atgaaatcaa aacagatctt ttcctgccgc aagtcaacga agtgatgctg 8220 
agctactggc agaaactaga acaacggttg tataacctgc gccataacct ctctattgat 8280 
ggtcaacctt tacatttgcc tattttcgca acaccagcag atccaaaagc gctgctcagc 8340 
gccgctgtcg ccagttcaca aggtggaagt aatcttccat cagaatttat atcagtgtgg 8400 
cgtttccctc atatgctgga aaacgcccgt agtatggtca gtcagctaac ccaattcggc 8460 
tccacattgc aaaatattat cgaacgtcaa gatgcggagg cattaaacac gctgttgcaa 8520 
aatcaggcgg cagaactgat attgaccaat ctcagcatac aggacaaaac catccaagag 8580 
ctggatgctg aaaaaactgt gctagaaaaa aaccgcgccg gaacccagtc gcgttttgat 8640 
agctacagca aattctacga tgaagacatc aacgcgggcg aaaaacaggc aatggcgttg 8700 
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cgtgcttccg tcgctggcat ctctacagcc cttcaagcat cacatctggc gggcgcagca 8760 
cttgatctgg cgcccaacat cttcggcttc gctgatggtg gcagccgttg gggggogatc 8820 
gcccaagcca caggtaatgt catggagttc tccgccagtg ttatgaacac cgaagcggat 8880 
aaaatcagcc aatctgaagc ctaccgtcgg cgtcgtcagg aatgggaaat tcagcgtaat 8940 
aacgccgagg cagagctgaa acaaatcgat gctcaacttg gttcgctggc agtgcgccgt 9000 
gaagccgcag tattgcagaa aaccagccta aaaacccaac aagagcagac tcatgcacaa 9060 
ctgaccttcc tgcaacataa gttcagtaat caggcgctgt acaactggct gcgtggtcga 9120 
ttgtccgcca tttacttcca gttctatgat ttaacggtag ctcgctgttt gatggcggaa 9180 
atggcctatc gctgggagac taacgatacc gcatcacgct ttatcaaacc cggcgcctgg 9240 
cagggaaccc atgccggttt gctcgcgggt gaaaccttaa tgctgaatct ggcacagatg 9300 
gaagatgccc acctgaaaca ggataaacgc gtactggagg tagaacgtac cgtttcgctg 9360 
gccgaagtct atgccaaatt accgcaagat aaatttatcc tgactcagga aatagagaag 9420 
ttggtgagta aaggttcagg cagggccggc aaggacaata ataagctggc gtttagtacc 9480 
aataccaata cctctctaga agcgtccatt tcgttatcta ccttgaacat tagcagcgat 9540 
tatcctgatt ctattggtaa aacccgtcgt attaaacaga tcagcgttac cctgccagca 9600 
ctgctaggac cctatcagga tgtgcaagca attctgtctt acagcggaaa agcctctgaa 9660 
ttggctgaaa gttgcaaatc attagcggtt tctcatggga tgaatgacag cggtcagttc 9720 
caactggatt tcaaogatgg caaattcctg ccgttcgaag gaatcaaaat cgatgaaggt 9780 
acgctgacat tgagcttccc aaatgcaatt agtaaagaag acaaaaaaga cgaaaaaggc 9840 
aaflcaacaag ccatgctgga gagtctgaac gacatcattc tgcatattcg ctacaccatt 9900 
cgccaataac gattttaatt aagtgctaaa acaggcccct aagcggggcc tgcaaggagt 9960 
ctttcatgca aaattcacaa gatttcagta ttacagaact atcattgccc aaaggaggag 10020 
gcgctatcac gggaatgggg gaagctttaa cccccaccgg gccggatggg atggcogcgc 10080 
tgtctctgcc gttgcctatc tctgccgggc gcggttatgc tccgtcactc gccttaaact 10140 
acaacagcgg cgccggtaac agcccatttg gtctgggctg ggattgcaac gttatgacca 10200 
tccgccgccg cacccatttt ggcgttccac attatgatga aaccgatacc tttctggggc 10260 
cagatggcga ggtactggtg gtagcggatc aatcccgcga cgaatcgaca ttacagggta 10320 
tcaacttagg caccgccttt acxzgttaccg gataccgttc ccgtctggag agtcatttca 10380 
gccgattgga atattggcaa cccaaggcaa cacccaagac aactggcaaa acagattttt 10440 
ggctgatata tagcccagat ggacaagtac atttactggg taaatcacca caagcccgga 10500 
tcagcaaccx; gtcagacatc actcaaacag cacaatggtt gctagaagcc tctgtgtxac 10560 
cacatggtga acaaatttat tatcaatatc gggccgagga taacaccggt tgcgaagctg 10620 
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atgaaattac tctccatcca caggccgccg cgcaacgtta tctacacaca gtgtattacg 10680 
gcaaccggac agccagcaaa acgttacccg gtctggatgg cagcgcccca ccacaagcag 10740 
actggttatt ctatctggta tttgattacg gcgaacgcag taacaacctg agaaogccgc 10800 
cagcattttc gactacaggt agctggcttt gtcgccagga ccgtttttcc cgttatgaat 10860 
atggttttga gattcgtaoc cgccgcttat gccgtcaggt attgatgtat caccacctgc 10920 
aagctctgga tagcgagata aaagaacaca acggaccaac gctggtttca cgcctgatac 10980 
tcaattatga cgaaagcgca atcgccagca cgctggtatt cgttcgtcga gtaggccacg 11040 
agcaagacgg tactgccgtc accctgccgc cattagaatt ggcgtatcag gatttttcac 11100 
cgcaacataa cgctcgctgg caatcgatga atgtgctggc aaacttcaat gccattcagc 11160 
gctggcaact agttgatcta aaaggcgaag gattccccgg tctgctatat caagataaag 11220 
gcgcctggtg gtaccgctcc gcacaacgtt ttggcaaaat tggctcagat gccgtcactt 11280 
gggaaaaaat gcaacctttg tcggttatcc cttccttgca aagtaatgcc tcgctggtgg 11340 
atatcaatgg agaoggccaa cttgactggg ttatcaccgg accgggatta cggggatatc 11400 
atagtcagca tccagatggc agttggacac gttttacccc gctcaacgct ctgccagtgg 11460 
aatatactca tccacgcgcg caactcgccg atttaatggg agctggactt tctgatttag 11520 
tactgatcgg ccctaagagt gtacgtttat atgccaatac ccgcgacggc tttgccaaag 11580 
gaaaagatgt agtgcaatcc ggtgatatca cactgccagt accgggcgcc gatccgtgta 11640 
agttggtggc atttagtgat gtattgggtt ccggtcaggc acatctggtt gaagtgagcg 11700 
cgactaaagt cacctgctgg cctaatctgg ggcacggacg ttttggtcaa ccaattactc 11760 
ttccgggatt tagccaacca gaagcgacgt ttaatcctgc tcaagtttat ctggccgatc 11820 
tagatggcag cggcccgact gatctgattt atgttcacac agatcgtctg gatatcttcc 11880 
tgaataaaag cggcaacggc ttcgccgcac cagtaactct ccccttccca gccggagtgc 11940 
gttttgatca tacctgtcag ttacaagtgg ccgatgtaca agggttaggc gtcgccagcc 12000 
tgatattaag tgtgccgcat atgactcccc atcactggcg ttgcgatctg accaacacaa 12060 
aaccgtggtt actcagtgaa atgaacaaca atatgggggc tcatcacacc ctgcgttacc 12120 
gtagttccgc ccagttctgg ctggatgaaa aagccacggc actggatgcc ggacaaatac 12180 
cagtttgtta tctacccttc ccggtacaca ccctatggca aacggaaata gaggatgaaa 12240 
tcagcggcaa caaattagtc acaatactac gttatgcaca tggcgcctgg gatggacgtg 12300 
agogagaatt tcgcggattt ggttatgttg aacagaaaga cagccatcaa ctggcccaag 12360 
gcagtgcgcc agaatgcaca ccacctgcac tgacccaagg caacgcgcct gaactcacat 12420 
cacccgcgct gacccaaggc aacgctccag aactcacacc acctgcgatg acccaaagca 12480 
acgcgcctga actcacatca cccgcgctga cocaaggcaa cgcgccagaa ttcacatcac 12540 
ccgcgctggc ccaaggcaat gcgccagaac tcacaccacc tgogatgacc aaaaactggt 12600 
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atgccaccgg aatacccatg atagataaca cattatcgac agagtattgg catggtgatc 12660 
accaagcttt tgccggtttt tcaccacgct ttacgacctg gcaagatggt caagatattc 12720 
tgctcacacc ggaaaatgat aacagtcagt actggctaaa ccgggcactg aaaggtcaac 12780 
tgctacgcag tgaactgtac ggcgaggatg gcagtacaca ggaaaaaatt ccctacacag 12840 
tcactgaatt tcgcccacag gtacgtcggt tacagcatac cgatagccga taccttgtgc 12900 
tttggtcatc tgtagttgaa agccgcaact atcattacga acgtatcgcc agcgatcctc 12960 
aatgcagcca aaagattacg ctatccagcg atctatttgg tcaaccgcta aaacaggttt 13020 
cggtacagta tccacgccgc cagcaaccgg caagcagtcc gtatcctgat acgttgcctg 13080 
ataagttatt tgctaacagc tatgatgacc agcaacacaa attacggctc acctatcaac 13140 
agttcagttg gcatcatctg accgacaata ccattctgat gttaggatta ccggatagta 13200 
cccgcagcga tatctttgct tatagcgctg aacatgtccc tactggtggt ctaaatctgg 13260 
aaatcctaaa tgataaaaat agtctgattg cggagaataa acctcgtgaa tacctcggcc 13320 
agcaaaaaac cgtttatacc gacgggcaaa atgcaacgcc atcgcaaacg ccaacacgac 13380 
aagcgctgat tgccttcacc gagacaacag tatttaatca atccacacta tcagcgtttg 13440 
atgggagtat ctcatctgct caattgtcaa cgacgctgga acaagccgga taccagcaaa 13500 
cagattatct attcccgcgc actggagaag ataaagtctg ggcagctcgt cgtggctata 13560 
ctgattacgg cacagccgaa cagttctggc ggccgcaaaa acagagcaac actcaactca 13620 
cgggcaaaat cacgctcact tgggatgcaa actattgcgt cgtcacacaa acccgggatg 13680 
cggctggact gacaacctca gccagatatg attggcgttt tctgaccccc gttcaactca 13740 
cggatatcaa cgacaatcag caccttacca cgctggatgc accgggccga ccaatcacac 13800 
tgcgcttttg gggaaccgaa aacggtaaga tgactggtta ttcttcaccg gaaaaaatat 13860 
cgttttctcc accatctgat gttgacgccg cgattaagtt aacaacgcca atccctgtag 13920 
cacagtgtca ggtctacgca cccgaaagct ggatgcccat attaaagaaa accctcaata 13980 
acctggcaga gcaagagcgg aaagagttat ataacacccg aatcatcacc gaagacggac 14040 
gcatctgtac cctagctcac cgccgctggg taaaaagcca aagtgcagtc acccagccaa 14100 
tcaatctgtc aaacggcagt ccccgtttac cccctcatag cctcacattg actacggatc 14160 
gttatgaccg cgatcttaag caacagattc gtcaacaagt agtattcagt gatggctttg 14220 
gccgtttact gcaagcatct gtacgacatg aagcaggcga agcctggcaa cgtaaccaag 14280 
acggcgctct ggtgacaaaa atggaagata ccaaaacgcg ctgggcggtt acgggacgca 14340 
ctgaatatga caataaggga caaccgatac gcacctatca accctatttc ctcaacgact 14400 
ggcaatacgt cagtaatgac agtgcccggc ggacagaaga agcctatgca gatacccatg 14460 
tctatgatxc cattggtcga gaaatcaagg tcactacogc aaaaggctgg ttccgtcgaa 14520 



WO 99/42589 PCT/EP99/01 015 

-30- 



ccttgttcac tccttggttt actgtcaatg aagatgaaaa tgacacagct actgaggtga 14580 

aggtaaagaa gaaagaatgt aaagaaggta aagaaggtaa agatgtaatt tgatcaatcc 14640 

cgcccggttg aagggcggga aacataacat aatatagagg tgaaacgtgt cattcataat 14700 

gccgtcagat actcaactta tgagttggtt gatcattggt tttattgcgg cctggggcgg 14760 

attagtaagg tacctcattg atatacaaaa caaacaatgt aaatggaatt ggatcaacgt 14820 

actctgtcaa ctcattatct cctgttttac cggtatattg ggaggactgc tgagttttga 14880 

aagcggoggc agcccctata tgacttttgc gattgccggg ctatttggca ccacgggaag 14940 

ttctggattg aactggatct ggcgtcgcct ttttatgcat tatcgcgatg atggaggaaa 15000 

gcaataaggc attcccactg ccgcaaaaac catctgtctc cggcagttaa acogggaaat 15060 

tacctactac aactattgta agaaaacgaa tatatagaaa aactaacatg cagataaaaa 15120 

ctgcgattgc agaacagatg acacacaacg ccccaacaac gaggtaaatc atg aaa 15176 

Met Lys 
1 

aac ate gat cct aaa ctt tat caa aag acc cct gtc gtc aac ate tac 15224 
Asn lie Asp Pro Lys Leu Tyr Gin Lys Thr Pro Val Val Asn He Tyr 
5 10 15 

gat aac cga ggt eta acg ate cgt aac ate gac ttt cac cgt acc acc 15272 
Asp Asn Arg Gly Leu Ihr He Arg Asn He Asp Phe His Arg Ihr Thr 
20 25 30 

gca aac ggc gat acc gat ate cgt att act cgc cat caa tat gac tec 15320 
Ala Asn Gly Asp Ihr Asp lie Arg He Ihr Arg His Gin Tyr Asp Ser 
35 40 45 50 

ctt ggg cac eta age caa age acc gat ccg cgt eta tat gaa gee aaa 15368 
Leu Gly His Leu Ser Gin Ser Ihr Asp Pro Arg Leu Tyr Glu Ala Lys 
55 60 65 

caa aaa tct aac ttt etc tgg cag tat gat ttg acc ggt aat att ttg 15416 
Gin Lys Ser Asn Phe Leu Trp Gin Tyr Asp Leu Ihr Gly Asn He Leu 
70 75 80 

tgt aca gaa age gtc gat get ggt cgc act gtc acc ttg aat gat att 15464 
Cys Ihr Glu Ser Val Asp Ala Gly Arg Ihr Val Thr Leu Asn Asp He 
85 90 95 

gaa ggc cgt ccg eta ctg aca gta act gca aca ggt gtc ata caa acc 15512 
Glu Gly Arg Pro Leu Leu Thr Val Ihr Ala Thr Gly Val He Gin Thr 
100 105 110 

cga caa tat gaa acg tct tec eta ccc ggt cgt ctg ttg tct gtt acc 15560 
Arg Gin Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser Val Thr 
115 120 125 130 

gaa caa ata cca gaa aaa aca tec cgt ate acc gaa cgc ctg att tgg 15608 
Glu Gin He Pro Glu Lys Thr Ser Arg He Thr Glu Arg Leu He Trp 
135 140 145 

get ggc aat age gaa gca gag aaa aac cat aat ctt gee age cag tgc 15656 
Ala Gly Asn Ser Glu Ala Glu Lys Asn His Asn Leu Ala Ser Gin Cys 
150 155 160 

gtg cgc cac tat gac acg gcg gga gtc acc cga tta gag agt ttg tea 15704 
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Val Arg His Tyr Asp Thr Ala Gly Val Thr Arg Leu Glu Ser Leu Ser 
165 170 175 

ctg acc ggt act gtt tta tct caa tec age caa eta ttg age gac act 15752 
Leu Thr Gly Thr Val Leu Ser Gin Ser Ser Gin Leu Leu Ser Asp Thr 
180 185 190 

caa gaa get age tgg aca ggt gat aat gaa acc gtc tgg caa aac atg 15800 
Gin Glu Ala Ser Trp Thr Gly Asp Asn Glu Thr Val Trp Gin Asn Met 
195 200 205 * 210 

ctg get gat gac ate tac aca acc ctg age gee ttt gat gee acc ggc 15848 
Leu Ala Asp Asp He Tyr Thr Thr Leu Ser Ala Phe Asp Ala Thr Gly 
215 220 225 

get tta etc act cag acc gat gcg aaa ggg aac att cag agg eta acc 15896 
Ala Leu Leu Thr Gin Thr Asp Ala Lys Gly Asn lie Gin Arg Leu Thr 
230 235 240 

tat gat gtg gee ggg cag eta aac ggg age tgg tta acc tta aaa gac 15944 
Tyr Asp Val Ala Gly Gin Leu Asn Gly Ser Trp Leu Thr Leu Lys Asp 
245 250 255 

caa ccg gaa caa gtg att ate aga tec ctg acc tat tec gee gee gga 15992 
Gin Pro Glu Gin Val He He Arg Ser Leu Thr Tyr Ser Ala Ala Gly 
260 265 270 

caa aaa tta cgc gag gaa cac ggc aat ggt gtt ate acc gaa tac agt 16040 
Gin Lys Leu Arg Glu Glu His Gly Asn Gly Val He Thr Glu Tyr Ser 
275 280 285 290 

tat gaa ccg gaa acc caa cag ctt ate ggt acc aaa acc cac cgt ccg 16088 
Tyr Glu Pro Glu Thr Gin Gin Leu He Gly Thr Lys Thr His Arg Pro 
295 300 305 

tea gat gee aaa gtg ttg caa gat eta cgt tat gag tat gac ccg gta 16136 
Ser Asp Ala Lys Val Leu Gin Asp Leu Arg Tyr Glu Tyr Asp Pro Val 
310 315 320 

ggc aat gtc ate agt ate cgt aat gac gca gaa gee acc cgc ttc tgg 16184 
Gly Asn Val He Ser He Arg Asn Asp Ala Glu Ala Thr Arg Phe Trp 
325 330 335 

cac aat cag aaa gtg gcg ccg gaa aac act tat acc tac gac tec ttg 16232 
His Asn Gin Lys Val Ala Pro Glu Asn Thr Tyr Thr Tyr Asp Ser Leu 
340 345 350 

tat cag ctt ate age gca acc ggg cgc gag atg gcg aat ata ggt cag 16280 
Tyr Gin Leu He Ser Ala Thr Gly Arg Glu Met Ala Asn He Gly Gin 
355 360 365 370 

caa agt aac caa ctt ccc tec etc acc eta cct tct gat aac aac acc 16328 
GlnSerAsnGlnLeuProSerLeuThrl^ProSerAspAsnAsn^ 
375 380 385 

tac acc aac tat acc cgt act tat act tat gac cgt ggc ggc aat ttg 16376 
Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly Gly Asn Leu 
390 395 400 

act aaa ate cag cac agt tea ccg gcg acg caa aac aac tac acc aca 16424 
Thr Lys He Gin His Ser Ser Pro Ala Thr Gin Asn Asn Tyr Thr Thr 
405 410 415 

aac ate acg gtt tct aac egg age aat cgc gca gta etc age act ctg 16472 
Asn He Thr Val Ser Asn Arg Ser Asn Arg Ala Val Leu Ser Thr Leu 
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420 425 430 

acc gaa gat ccg gcg caa gta gat get tta ttt gat gca ggc gga cat 
Thr Glu Asp Pro Ala Gin Val Asp Ala Leu Phe Asp Ala Gly Gly His 
435 440 445 450 

cag aac acg ttg ata tea gga caa aac ctg aac tgg aat aca cgc ggt 
Gin Asn Thr Leu lie Ser Gly Gin Asn Leu Asn Trp Asn Thr Arg Gly 
455 460 465 

gaa eta caa cat gtg aca ttg gtg aaa egg gac aag ggc gee aat gat 
Glu Leu Gin His Val Thr Leu Val Lys Arg Asp Lys Gly Ala Asn Asp 
470 475 480 



tat ttg ccg age tta gaa ctt cgt eta aca caa aac age acg ate aca 
Tyr Leu Pro Ser Leu Glu Leu Arg Leu Thr Gin Asn Ser Thr lie Thr 
515 520 525 530 



16520 



16568 



16616 



gat egg gaa tgg tat cgc tat agt agt gac ggg aga agg ata tta aaa 16664 
Asp Arg Glu Trp Tyr Arg Tyr Ser Ser Asp Gly Arg Arg He Leu Lys 
485 490 495 

ate aat cag cag acc age age aac tct caa aca cag aga ata act 16712 
He Asn Glu Gin Gin Thr Ser Ser Asn Ser Gin Thr Gin Arg He Thr 
500 505 510 



16760 



acc gaa gat ttg caa gtt ate aca gta gga gaa gcg ggt egg gca cag 16808 
Thr Glu Asp Leu Gin Val He Thr Val Gly Glu Ala Gly Arg Ala Gin 
535 540 545 

gta cga gta tta cat tgg gat age ggt caa ccg gaa gat ate gac aat 16856 
val Arg Val Leu His Trp Asp Ser Gly Gin Pro Glu Asp He Asp Asn 
550 555 560 

aat cag eta cgt tat age tac gat aat ctt ate ggt tec agt caa ctt 16904 
Asn Gin Leu Arg Tyr Ser Tyr Asp Asn Leu He Gly Ser Ser Gin Leu 
565 570 575 

gaa tta gac age aaa gga gaa att att agt gag gaa gag tac tat ccc 16952 
Glu Leu Asp Ser Lys Gly Glu He He Ser Glu Glu Glu Tyr Tyr Pro 
580 585 590 

tat ggc ggc acg gca tta tgg gca aca agg aag egg aca gaa gee agt 17000 
Tyr Gly Gly Thr Ala Leu Trp Ala Thr Arg Lys Arg Thr Glu Ala Ser 
595 600 '605 610 

tat aaa acc ate cgt tat tea ggt aaa gag egg gat gee acc gga eta 17048 
Tyr Lys Thr He Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu 
615 620 625 

tat tat tac ggt tac cga tat tat cag cct tgg gta gga cga tgg tta 17096 
Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gin Pro Trp Val Gly Arg Trp Leu 
630 635 640 

agt gee gat ccg gca gga aca gta gat ggg ttg aat tta tat egg atg 17144 
Ser Ala Asp Pro Ala Gly Thr Val Asp Gly Leu Asn Leu Tyr Arg Met 
645 650 655 

gta agg aat aat ccg gtt act ctg ctt gat cct gat gga tta atg cca 17192 
Val Arg Asn Asn Pro Val Thr Leu Leu Asp Pro Asp Gly Leu Met Pro 
660 665 670 

aca att gca gaa cgc ata gca gca ctg caa aaa aat aaa gta gca gat 17240 
Thr He Ala Glu Arg He Ala Ala Leu Gin Lys Asn Lys Val Ala Asp 
675 680 685 690 
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tca gcg cct teg cca aca aat gec aca aac gta gcg ata aac ate cgc 17288 
Ser Ala Pro Ser Pro Thr Asn Ala Thr Asn Val Ala lie Asn He Arg 
695 700 705 

ccg ccc gta gca cca aaa cct acc tta ccc aaa gca tea acg agt age 17336 
Pro Pro Val Ala Pro Lys Pro Thr Leu Pro Lys Ala Ser Thr Ser Ser 
710 715 720 

caa tea act aca tac ccc ate aaa tct gca age ata aaa cca acg acg 17384 
Gin Ser Thr Thr Tyr Pro He Lys Ser Ala Ser He Lys Pro Thr Thr 
725 730 735 

teg gga tea tec att act get cca ctg agt cca gta gga aat aaa tct 17432 
Ser Gly Ser Ser He Ihr Ala Pro Leu Ser Pro Val Gly Asn Lys Ser 
740 745 750 

act cct gaa ata tct ctt cca gaa age act caa age aat tct tea age 17480 
Thr Pro Glu He Ser Leu Pro Glu Ser Thr Gin Ser Asn Ser Ser Ser 
755 760 765 770 

get att tea aca aat eta cag aaa aag tea ttt act tta tat aga gcg 17528 
Ala He Ser Thr Asn Leu Gin Lys Lys Ser Phe Thr Leu Tyr Arg Ala 
775 780 785 

gat aat aga tec ttt gaa gac atg cag agt aaa ttc cct gaa gga ttt 17576 
Asp Asn Arg Ser Phe Glu Asp Met Gin Ser Lys Phe Pro Glu Gly Phe 
790 795 800 

aaa gee tgg act cct eta gat act aag atg gca agg cag ttt get agt 17624 
Lys Ala Trp Thr Pro Leu Asp Thr Lys Met Ala Arg Gin Phe Ala Ser 
805 810 815 

gtc ttt att ggt cag aaa gat act tct aat tta cct aaa gaa aca gtc 17672 
Val Phe He Gly Gin Lys Asp Thr Ser Asn Leu Pro Lys Glu Thr Val 
820 825 830 

aag aat ata aac aca tgg gga aca aaa cca aaa tta aat gat etc tea 17720 
Lys Asn He Asn Thr Trp Gly Thr Lys Pro Lys Leu Asn Asp Leu Ser 
835 840 845 850 

act tac ata aaa tat acc aag gac aaa tct aca gta tgg gtc tct act 17768 
Thr Tyr He Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val Ser Thr 
855 860 865 

gca att aat act gaa gca ggt gga caa agt tea ggg get cca etc cat 17816 
Ala He Asn Thr Glu Ala Gly Gly Gin Ser Ser Gly Ala Pro Leu His 
870 875 880 

gaa att aat atg gat ctt tat gag ttt acc att gac gga caa aag eta 17864 
Glu He Asn Met Asp Leu Tyr Glu Phe Thr He Asp Gly Gin Lys Leu 
885 890 895 

aat cca eta cca agg. gga aga tct aaa gac agg gtg cct tea eta tta 17912 
Asn Pro Leu Pro Arg Gly Arg Ser Lys Asp Arg Val Pro Ser Leu Leu 
900 905 910 

ctt gac aca cca gaa ata gaa aca gca tec ata att gca ctt aat cat 17960 
Leu Asp Thr Pro Glu He Glu Thr Ala Ser He He Ala Leu Asn His 
915 920 925 930 

gga ccg gta aat gat gca gaa gtt tea ttc eta aca aca att ccg ctt 18008 
Gly Pro Val Asn Asp Ala Glu Val Ser Phe Leu Thr Thr He Pro Leu 
935 940 945 
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aaa aat gta aaa cct tat aag aga taa cgaaaaatta atattcttta 18055 
Lys Asn Val Lys Pro Tyr Lys Arg 

950 955 

tctactttta atagccctct tgaacttaca ctcaaggggg ggaaaccaaa taagaaacca 18115 

tctttaataa caagccatga aagaatattt atttcatggc ttgattactt ttaacattca 18175 

atattaaata at taaaacaa tatctaacca attaaaataa caatacctta tttatcatat 18235 

taaaatatca aatcagaaat taatgaattt aagggttctt tatatttatt tctgagagca 18295 

taggcacaat accttaccga tggcgctgga cgtgattcaa aatccagaaa tgctatattt 18355 

tcatcaatat gggcagaata gcgcatttca ttgggagtca ttaaacttat cgcgacaccc 18415 

gcttttacca gatccaatct. attagtaaaa tcagggaccg tcaataacgc taaattttgg 18475 

tattcaggga gataattcaa tggcataaaa ttattgcatt gttttaaaaa agcactatta 18535 

tgctgaacaa aagg aaaac t agatattatt tcatcagcgt gactttctgg ttctaaaata 18595 

tcatgggata cagcaagaga cagcatttga taagcaccat ctatoctgat gatatcatca 18655 

ttatctggat aacattcagt cgtcacataa actgttatat cccctttcat taaggaagaa 18715 

aataccgcat cttgccttat taaatcatca attagaaaat tgttgattat acaaatatcg 18775 

cgataatgat aacgttgcac ogctcttttt acgaccgtag atattttatt aacatattct 18835 

ccacttgtgc caataaccag tttgtctctg tttgataatt tataatttct acgacaattc 18895 

caattattct caactttcag gatcctttca taacacggca gcaactcttg atatagtgcc 18955 

tttccctctt ctgtgagctt ggtttttccc ggtagtcgct caaacaattg acaccccaca 19015 

ogctgttcca gttgatatac gagcctgcta agtggagaag gggtaataca aagogtatcc 19075 

gccgctaacg tgaatgactc tttcttagct gattccataa aatactttag ttgctttgaa 19135 

caaaatatca tcacataccc tcttgttttc attccagaaa tagaatatta accatagaac 19195 

atgacaacga tgtttctact ttgcattctt ttacattagg acatgcgtta atggacattg 19255 

aatttcacta catcaattgt taatatttat ttaatacttg cacaataatt ataaaataaa 19315 

tataacttag ttaattattt cttgatattg atcatggtaa gttttcctca ataoctacag 19375 

aagtagatat tattttatct tccagtaatc tatcgtttgg cgacggaggt cgattcttcc 19435 

attgggatat tcaacccatt cgccgccttt cttattaatt acagtgattt ttggcatttt 19495 

ggtttcatcc aacttaggtt tataggtgat tttccattta gcacccggtg ttaacttcaa 19555 

cctaaaggga tacataccaa cttcaccttg taagaatatt ctgtttggtc taccttcaac 19615 

gactttcaaa atggggtaaa taaccgggct aaaatcaatc gtatccaatg catcaatttc 19675 

gctgatattt gtccgggctg catcattgat aaatgcgatt aaatcggttg ctgaatacgg 19735 

aatagcatct ttcactagat gacggacatc ggtataactc actgacacaa aggctcggtc 19795 

aatcttccac ttacatcgac cgccaccatt aaaaggtagt tttgcctgaa agtaaccggt 19855 

tttcggatca gcttttacat ccagacgtaa tccgttataa gttggtacct taaaaggcga 19915 
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catattggaa 


tctaaacgat 


atttaaggca 


atcttttgag 


atatacacag 


cggatacatg 19975 


cggctgtgtg 


tatttaggtg 


cgactccttc 


tacagtaatc 


cactgattct 


ctttgggagg 20035 


agagagcggc 


tcatttgggt 


cagcacagcc 


tgatattaaa 


atcacggata 


agacagataa 20095 


gtatttcttg 


atatttatca 


tggtaagttt 


tcctcaactc 


ctacagcgtt 


atctgcatgt 20155 


gtgtccaatt 


ccagatcttc 


ctgtttatct 


atttagaaat 


aaataagcta 


cgctgatagc 20215 


attacttcat 


atttccatac 


atgaatcgaa 


aatcgacttc 


ttgagtgccg 


ttatcaattt 20275 


tgccgcccgg 


atattcaacc 


cactcgccgc 


ctttcttatt 


agtcaccgtg 


accttcgcca 20335 


ttttggtttc 


atccagctta 


ggcttaaaaa 


taattttcca 


tttagctcct 


ggagttaacg 20395 


tgagttgaaa 


aggacgcatt 


tttaatactt 


caccttgtaa 


gaatattctg 


ttcgggcgac 20455 


cttcaacgac 


ttfccaaaaca 


gggtaaataa 


ccgggctaaa 


atcaatcgta 


ttcaatgtcg 20515 


agattttgct 


aatattcatc 


tggactatgc 


cattgataga 


tgcgattaaa 


ccggttgctg 20575 


aatacggaat 


agcatctttc 


accagatggc 


tgacatcagt 


ataactcacc 


gatacaaagg 20635 


cccggttaat 


tttccattta 


catcgtcccc 


ctccattaaa 


aggtagtttt 


gcttgaaaat 20695 


aaccggtttg 


tggatcggcc 


ttcactttca 


gacgaagccc 


attataggtc 


ggcactttaa 20755 


aaggcgacat 


attggaatcc 


agacgatact 


caaggcaatc 


ctttgatatg 


tattctgcgg 20815 


atacatgtgg 


ttcggtatat 


ttcggcgcta 


ccccttctac 


cgtgatccat 


tgattttctt 20875 


taggagggga 


aagcggctca 


tttgggtcag 


cacagcctga 


tattaaaatc 


actgacaaga 20935 


caaataagta 


ttttttaaca 


tttatcatgg 


taagttttcc 


tcaattccta 


cagcattatc 20995 


cgcataaata 


tcctgtcaag 


aatagcgttc 


attgatttcg 


tcaccaaaga 


aacaagatag 21055 


tv*aaaatcct 


attaccacag 


ataaaaaaca 


ccgcttatgc 


cgtgagtaat 


agtgagttga 21115 


gcgacaggga 


tacagcagtg 


catccccatc 


aattagtccc 


tttgaataaa 


gggaacagaa 21175 


tttgaaattt 


ccgtcatacc 


gtccatatta 


oggaacttag 


attatgatta 


ttaaatcacc 21235 


accaaatggc 


aagaaaaatt 


ttcatttttt 


aatttacgaa 


gaatgaattt 


gtaagaaagt 21295 


gttacaaact 


taatagaaat 


taatttactg 


ttaatctaat 


gaaggatgaa 


attataaaaa 21355 


taacccattt 


ctcagggaca 


acaatccaca 


atatatagaa 


ccactggtcc 


tcacttaatt 21415 


tcctgtcagg 


agtagaaata 


tcctgatgac 


tcagtcgatg 


acatacagca 


atgtcattgg 21475 


tattgagact 


acogactgtt 


taataaattt 


cttttgtctt 


taatggcgag 


atacaagtga 21535 


ttcactattt 


aagcactatc 


gataaataag 


attccaaaat 


agcgccatat 


cttacaccac 21595 


tcataattct 


atgtataaca 


attggttaaa 


taggatcatg 


tgtaacagga 


ttatgaaacg 21655 


ttatttatat 


caaatctatc 


aattatttta 


tatatagttt 


cacagtcaca 


ctcgctatct 21715 


ggtaccttca 


taaccaactg 


ccctccctgc 


gctaccttct 


gataacaaca 


gctacactaa 21775 


ctatacccgc 


gcctataatt 


atgacogtgt 


gaaaattcag 


cgtagttcac 


cggccacgca 21835 
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aaataactac acgaaatatt gctccccaga gaaacaccgt tcgaggttgt ttcaatgaaa 21895 
catcaaggta gagacaccta tgtattatta caagatatta aaccctctgc gattactcat 21955 
aggaatgtac gtaatactta tacaggcaac ttcacgtcat ccagagaaaa ttaagttgta 22015 
ca aaatagac atcaactaat atagtaatag aaaatcccct gaaaatagat tcaggggatt 22075 
ta a taaa tta aocaaaaatc ataataaaaa tttatttcat tattttagga taaatattta 22135 
attagcctaa taatgaatta ttacttaaag taatxcctaa acaatcaaat cggaaattaa 22195 
taaattcaat ggttcttgat atttatgcct gagagcataa gcacaatatt tcactgaggg 22255 
tgtcggatgc gatttaaaat tcaaaaaggt aatgccttta tgaaggtcag cagaacagag 22315 
cacttcattc ggtgtcatta aacttatcgc gatacctgat ttcactaagt ccaaccgatt 22375 
agtaaaatca ggcacagtta gcaaagttaa attctggtat tcaggtagac aattcagtga 22435 
aataaaattg ccgcactgtt taaagaaagc actattatgc tgagccaaag gcaatgtata 22495 
tataatatta tcagcgttac tttccgatcc taaaatatca ttagctacag caaggcgcaa 22555 
agtctgataa gttccatcca ctctgataat atcatcgttg tcaggataat gttcagtcgt 22615 
cacatagacg gttatatctt ccttcattaa agaagaaaat attgtctctt ttcttgccga 22675 
atcatcaacc agaaaattgt tttttatgta aatatcacga taatgataac gttgtaccgc 22735 
tctttttatc actgccgaaa ttttattaat atattctcca cttgtcccga tgaccagttt 22795 
gccggtgttt gctaattttc cgcttctacg ataatgccaa ttatcctcaa ctcgctgaat 22855 
tctctcataa cacggcaata gctcctgata taatgccttc ccctcttcag tgagtttggt 22915 
ccttcccggt agtcgttcaa atagttgaca gcccacacgt tgttccagtt gatatatgat 22975 
cctacttaga ggagagggag taatacaaag cgtatccgcg gctaaagtaa atgactcttt 23035 
tttcgctgac tccataaaat atttcaagct ctttgaacaa aatagcatca tatatccttc 23095 
ttattttaat tcattgttcc atccgaaata gaatggaatg ttaacaagaa aacattacaa 23155 
ctacttttct tctttgcatt atttaacatc aaagtatgca ttaactgaga ttgagtttta 23215 
tcatctttat tcttaacagt tatcaaacaa ttttcattat tattgcaaaa taaatacaac 23275 
cccttcLtat gttacaataa tgattataaa gaaatttcac atattatcat taagtaataa 23335 
tgggcacaat taaccattta attaaacatt tcaattggtt gacaaagact cattatgttc 23395 
aacatgtaat gagcgcaatt ttaacattaa ataaattaca tagttcatat tcattatcac 23455 
tgagatcagc ttttttcgta cagtacatca tgtgaacaat accgtgccat ttcctgccaa 23515 
atcttattaa aaagtcagtt gcaaattttg catctgcttt ttttgcaaca gctatttaaa 23575 
gaaaacagtg agatagtgat tatccgagag atcaagatat gtctgctctt tacgcacaaa 23635 
ctgcaaacca tttctatgca- tatctcagct atttctcaaa acctgtattt aatcatctct 23695 
tattccgatg gaacggaatc attctctgat tgattcatga tgtaaagaca atatggatgt 23755 
ttcatttact tt atg att tta aaa gga ata aat atg aat teg cct gta aaa 23806 
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Met He Leu Lys Gly lie Asn Met Asn Ser Pro Val Lys 
960 965 

gag ata cct gat gta tta aaa ate cag tgt ggt ttt cag tgt ctg aca 23854 
Glu He Pro Asp Val Leu Lys He Gin Cys Gly Phe Gin Cys Leu Thr 
970 975 980 

gat att age cac age tct ttt aac gaa ttt cac cag caa gta tec gaa 23902 
Asp He Ser His Ser Ser Phe Asn Glu Phe His Gin Gin Val Ser Glu 
985 990 995 1000 

cac etc tec tgg tec gaa gca cac gac tta tat cat gat gca caa cag 23950 
His Leu Ser Trp Ser Glu Ala His Asp Leu Tyr His Asp Ala Gin Gin 
1005 1010 ~ 1015 

gee caa aag gat aat egg ctg tat gaa gcg cgt att ctt aaa cgc acg 23998 
Ala Gin Lys Asp Asn Arg Leu Tyr Glu Ala Arg He Leu Lys Arg Ihr 
1020 1025 1030 

aat cct caa tta caa aat get gta cat ctt gee ate gta gcg cct aat 24046 
Asn Pro Gin Leu Gin Asn Ala Val His Leu Ala He Val Ala Pro Asn 
1035 1040 1045 

get gaa ctg ata ggc tat aac aac caa ttt age ggc agg gee agt caa 24094 
Ala Glu I^su He Gly Tyr Asn Asn Gin Phe Ser Gly Arg Ala Ser Gin 
1050 1055 1060 

tat gtc gcg ccg ggt acc gtt tec tec atg ttc tec ccc gee get tat 24142 
Tyr Val Ala Pro Gly Ihr Val Ser Ser Met Phe Ser Pro Ala Ala Tyr 
1065 1070 1075 1080 

ttg act gag ctt tat cgt gaa gca cgc aat tta cac gec age gat tec 24190 
Leu Ihr Glu Leu Tyr Arg Glu Ala Arg Asn Leu His Ala Ser Asp Ser 
1085 1090 1095 

gtt tat cgc ctg gat act cgc cgc cca gat etc aaa tea atg gcg etc 24238 
Val Tyr Arg Leu Asp Ihr Arg Arg Pro Asp Leu Lys Ser Met Ala Leu 
1100 1105 ~ 1110 

agt caa caa aat atg gat acg gaa ctt tec act etc tct tta tec aat 24286 
Ser Gin Gin Asn Met Asp Thr Glu Leu Ser Ihr Leu Ser Leu Ser Asn 
1115 1120 H25 

gag eta tta ttg gaa age att aaa act gag tct aag ctg gat aat tat 24334 
Glu leu Leu Leu Glu Ser He Lys Ihr Glu Ser Lys Leu Asp Asn Tyr 
1130 1135 1140 

act caa gtg atg gaa atg etc tec get ttc cgt cct tec ggc gcg acg 24382 
Ihr Gin Val Met Glu Met Leu Ser Ala Phe Arg Pro Ser Gly Ala Ihr 
1145 1150 1155 ' H60 

cct tat cac gat get tac gaa aat gtg cgt aaa gtt ate cag eta caa 24430 
Pro iyr His Asp Ala Tyr Glu Asn Val Arg Lys Val He Gin Leu Gin 
1165 H70 H75 

gat cct ggg ctt gag caa tta aat get tea cca gee att gee ggg ctg 24478 
Asp Pro Gly Leu Glu Gin Leu Asn Ala Ser Pro Ala He Ala Gly Leu 
1180 H85 1190 

atg cat caa get tec eta tta ggt att aac get tea ate tea cct gag 24526 
Met His Gin Ala Ser Leu Leu Gly He Asn Ala Ser He Ser Pro Glu 
1195 1200 1205 

ttg ttt aat att ctg acg gag gag att act gaa ggt aat get gag gaa 24574 
Leu Phe Asn He Leu Thr Glu Glu He Thr Glu Gly Asn Ala Glu Glu 
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1210 1215 1220 

ctt tat aag aaa aat ttt ggt aat ate gaa ccg get tea ctg get atg 24622 
Leu Tyr Lys Lys Asn Phe Gly Asn lie Glu Pro Ala Ser Leu Ala Met 
1225 1230 1235 1240 

ccg gaa tac ctt aga cgt tat tac aat tta agt gat gaa gaa etc age 24670 
Pro Glu Tyr Leu Arg Arg Tyr Tyr Asn Leu Ser Asp Glu Glu Leu Ser 
1245 1250 1255 

cag ttt att ggt aaa gee age aat ttc ggc caa caa gaa tat agt aat 24718 
Gin Phe He Gly Lys Ala Ser Asn Phe Gly Gin Gin Glu Tyr Ser Asn 
1260 1265 1270 

aac caa etc att act ccg ata gtc aac age aat gat ggc aca gtc aag 24766 
Asn Gin Leu He Thr Pro He Val Asn Ser Asn Asp Gly Thr Val Lys 
1275 1280 1285 

gta tat cga att ace cgc gaa tat aca aca aat gee aat caa gta gac 24814 
Val Tyr Arg He Thr Arg Glu Tyr Tnr Tnr Asn Ala Asn Gin Val Asp 
1290 1295 1300 

gtg gag ctg ttt ccc tac ggt gga gaa aat tat cag tta aat tac aaa 24862 
Val Glu Leu Phe Pro Tyr Gly Gly Glu Asn Tyr Gin Leu Asn Tyr Lys 
1305 1310 1315 1320 

ttc aaa gat tct cgt cag gat gtc tec tat tta tec ate aaa tta aat 24910 
Phe Lys Asp Ser Arg Gin Asp Val Ser Tyr Leu Ser He Lys Leu Asn 
1325 1330 1335 

gac aaa aga gaa ctt ate cga att gaa gga gcg cct cag gtc aac ate 24958 
Asp Lys Arg Glu Leu He Arg He Glu Gly Ala Pro Gin Val Asn He 
1340 1345 1350 

gaa tat tea gaa cat ate aca tta agt aca act gat ate agt caa cct 25006 
Glu Tyr Ser Glu His He Thr Leu Ser Tnr Thr Asp He Ser Gin Pro 
1355 1360 1365 



ttt gaa ate ggc eta aca cga gta tat cct tct agt tct tgg gca tat 
Phe Glu He Gly Leu Thr Arg Val Tyr Pro Ser Ser Ser Trp Ala Tyr 
1370 1375 1380 



25054 



gca gee gca aaa ttt acc att gag gaa tat aac caa tac tct ttc ctg 25102 
Ala Ala Ala Lys Phe Thr He Glu Glu Tyr Asn Gin Tyr Ser Phe Leu 
1385 1390 1395 1400 

tta aaa etc aat aaa get att cgt eta tct cgt gcg aca gaa tta tea 25150 
Leu Lys Leu Asn Lys Ala He Arg Leu Ser Arg Ala Thr Glu Leu Ser 
1405 1410 1415 

ccc acc att ctg gaa agt att gtg cgt agt gtt aat cag caa ctg gat 25198 
Pro Thr He Leu Glu Ser He Val Arg Ser Val Asn Gin Gin Leu Asp 
1420 1425 1430 

ate aac gca gaa gta tta ggt aaa gtt ttt ctg act aaa tat tat atg 25246 
He Asn Ala Glu Val Leu Gly Lys Val Phe Leu Thr Lys Tyr Tyr Met 
1435 1440 1445 

caa cgt tat get att aat get gaa act gee eta ata eta tgc aat gca 25294 
Gin Arg Tyr Ala He Asn Ala Glu Tnr Ala Leu He Leu Cys Asn Ala 
1450 "* 1455 1460 

ctt att tea caa cgt tea tat gat aat caa cct age caa ttt gat cgc 25342 
Leu He Ser Gin Arg Ser Tyr Asp Asn Gin Pro Ser Gin Phe Asp Arg 
1465 1470 1475 1480 
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ctg ttt aat acg cca tta ctg aac ggc caa tat ttt tct acc gga gat 25390 
Leu Phe Asn Thr Pro Leu Leu Asn Gly Gin Tyr Phe Ser Thr Gly Asp 
1485 1490 1495 

gaa gag att gat tta aat cca ggt agt act ggc gat tgg cgt aaa tec 25438 
Glu Glu He Asp Leu Asn Pro Gly Ser Thr Gly Asp Trp Arg Lys Ser 
1500 1505 1510 

gtg ctt aaa cgt gca ttt aat ate gat gat att tec etc tac cgc ctg 25486 
Val Leu Lys Arg Ala Fhe Asn He Asp Asp He Ser Leu Tyr Arg Leu 
1515 1520 1525 

ctt aaa att acc aac cat aat aat caa gat gga aag att aaa aat aac 25534 
Leu Lys He Thr Asn His Asn Asn Gin Asp Gly Lys He Lys Asn Asn 
1530 1535 1540 

tta aat aat ctt tct gat tta tat att ggg aaa tta ctg gca gaa att 25582 
Leu Asn Asn Leu Ser Asp Leu Tyr He Gly Lys Leu Leu Ala Glu He 
1545 1550 1555 1560 

cat caa tta acc att gat gaa ttg gat tta ttg ctg gtt gee gtg ggt 25630 
His Gin Leu Thr He Asp Glu Leu Asp Leu Leu Leu Val Ala Val Gly 
1565 1570 1575 

gaa gga gaa act aat tta tec get ate agt gat aaa caa ctg gcg gca 25678 
Glu Gly Glu Thr Asn Leu Ser Ala He Ser Asp Lys Gin Leu Ala Ala 
1580 1585 1590 

ctg ate aga aaa etc aat acc att acc gtc tgg eta cag aca cag aag 25726 
Leu He Arg Lys Leu Asn Thr He Thr Val Trp Leu Gin Thr Gin Lys 
1595 1600 1605 

tgg agt gcg ttc caa tta ttt gtt atg act tec acc age tat aac aaa 25774 
Trp Ser Ala Phe Gin Leu Phe Val Met Ihr Ser Ihr Ser Tyr Asn Lys 
1610 1615 1620 

acg ctg acg cct gaa att aag aat ctg ctg gat acc gtc tac cac ggt 25822 
Ihr Leu Thr Pro Glu He Lys Asn Leu Leu Asp Ihr Val Tyr His Gly 
1625 1630 1635 1640 

tta caa ggc ttt gat aaa gac aag gca aat tta ctg cat gtt atg gcg 25870 
Leu Gin Gly Phe Asp Lys Asp Lys Ala Asn Leu Leu His Val Met Ala 
1645 1650 1655 

ccc tat att gcg gee acc tta caa tta tea teg gaa aat gtc gee cat 25918 
Pro Tyr He Ala Ala Thr Leu Gin Leu Ser Ser Glu Asn Val Ala His 
1660 1665 1670 

tct gtg ctg ctt tgg gca gac aag tta aag ccc ggc gac ggc gca atg 25966 
Ser Val Leu Leu Trp Ala Asp Lys Leu Lys Pro Gly Asp Gly Ala Met 
1675 1680 1685 

aca gee gaa aaa ttc tgg gac tgg ttg aat act caa tat acg cca gat 26014 
Thr Ala Glu Lys Phe Trp Asp Trp Leu Asn Thr Gin Tyr Thr Pro Asp 
1690 1695 1700 

tea teg gaa gta tta gca aca cag gaa cat att gtt cag tat tgt cag 26062 
Ser Ser Glu Val Leu Ala Thr Gin Glu His He Val Gin Tyr Cys Gin 
1705 1710 1715 -1720 

gcg ttg gcg caa tta gaa atg gtt tac cat tec acc ggt ate aat gaa 26110 
Ala Leu Ala Gin Leu Glu Met Val Tyr His Ser Thr Gly He Asn Glu 
1725 1730 1735 
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aac gcc ttc cgc ctg ttt gtg aca aaa cca gag atg ttt ggc teg tea 
Asn Ala Phe Arg Leu Phe Val Thr Lys Pro Glu Met Phe Gly Ser Ser 
1740 1745 1750 



26158 



act gag gca gta cct gcg cat gat gca ctt tea ctg ate atg ctg acg 26206 
Thr Glu Ala Val Pro Ala His Asp Ala Leu Ser Leu lie Met Leu Thr 
1755 1760 1765 

cgt ttt gca gat tgg gtt aat gcg tta ggc gaa aaa gcc tct tec gta 26254 
Arg Phe Ala Asp Trp Val Asn Ala Leu Gly Glu Lys Ala Ser Ser Val 
1770 1775" 1780 

eta gcg gca ttt gaa get aac agt tta acg gca gaa caa ttg get gat 26302 
Leu Ala Ala Phe Glu Ala Asn Ser Leu Thr Ala Glu Gin Leu Ala Asp 
1785 1790 1795 1800 



gcc atg aat ctt gat get aat ttg eta ttg caa gcc agt act caa gca 
Ala Met Asn Leu Asp Ala Asn Leu Leu Leu Gin Ala Ser Thr Gin Ala 
1805 1810 1815 



26350 



caa aac cat caa cat ctt ccc cca gtg acg caa aaa aat get ttc tec 
Gin Asn His Gin His Leu Pro Pro Val Ihr Gin Lys Asn Ala Phe Ser 
1820 1825 1830 



26398 



tgt tgg aca tct ate gac act ate ctg caa tgg gtt aat gtt gca caa 
Cys Trp Hit Ser lie Asp Thr lie Leu Gin Trp Val Asn Val Ala Gin 
1835 1840 1845 



26446 



caa ttg aat gtc gcc cca cag gga gtt tec get ttg gtc ggg ctg gat 
Gin Leu Asn Val Ala Pro Gin Gly Val Ser Ala Leu Val Gly Leu Asp 
1850 1855 1860 



26494 



tat att caa tta aat caa aaa ate ccc acc tat gcc cag tgg gaa agt 
Tyr He Gin Leu Asn Gin Lys He Pro Thr Tyr Ala Gin Trp Glu Ser 
1865 1870 1875 1880 



26542 



get ggg gaa ata ttg act gcc gga ttg aat tea caa cag get gat ata 26590 
Ala Gly Glu He Leu Thr Ala Gly Leu Asn Ser Gin Gin Ala Asp He 
1885 1890 1895 

tta cac get ttt ttg gac gaa tct cgc agt gcc gca tta age acc tac 26638 
Leu His Ala Phe Leu Asp Glu Ser Arg Ser Ala Ala Leu Ser Thr Tyr 
1900 1905 1910 



tat ate cgt caa gtc gcc aag cca gcg gca gcc ata aaa age cgt gat 
Tyr He Arg Gin Val Ala Lys Pro Ala Ala Ala He Lys Ser Arg Asp 
1915 1920 1925 



26686 



gac ttg tac caa tac tta eta att gat aat cag gtt tec get gca ate 
Asp Leu Tyr Gin Tyr Leu Leu He Asp Asn Gin Val Ser Ala Ala He 
1930 1935 1940 



26734 



aaa act acc egg att gcc gaa gcc att gcc age att caa ctg tac gtc 
Lys Thr Thr Arg He Ala Glu Ala He Ala Ser He Gin Leu Tyr Val 
1945 1950 1955 1960 



26782 



aac cgc acg ctg gaa aat gta gaa gaa aat gcc cat tea ggg gtt ate 
Asn Arg Thr Leu Glu Asn Val Glu Glu Asn Ala His Ser Gly Val He 
1965 1970 1975 



26830 



age cgt cag ttc ttt ate gac tgg gac aaa tat aac aaa cgc tac age 
Ser Arg Gin Phe Phe He Asp Trp Asp Lys Tyr Asn Lys Arg Tyr Ser 
1980 1985 1990 



26878 



acc tgg gcg ggt gtt tct caa tta gtt tac tac ccg gaa aac tat att 26926 



WO 99/42589 

-41 - 



Thr Trp Ala Gly Val Ser Gin Leu Val Tyr Tyr Pro Glu Asn Tyr lie 
1995 2000 2005 

gat ccc acc atg cgt ate gga caa acc aaa atg atg gac gca tta ttg 
Asp Pro Thr Met Arg He Gly Gin Thr Lys Met Met Asp Ala Leu Leu 
2010 2015 2020 

caa tec gtc age caa age caa tta aat gec gat act gtc gaa gac gee 
Gin Ser Val Ser Gin Ser Gin Leu Asn Ala Asp Thr Val Glu Asp Ala 
2025 2030 2035 2040 

ttt atg tct tat ctg aca teg ttt gag caa gtg get aat ctt aaa gtt 
Phe Met Ser Tyr Leu Thr Ser Phe Glu Gin Val Ala Asn Leu Lys Val 
2045 2050 2055 

att age gcg tat cac gat aat att aac aac gat caa ggg ctg acc tat 
He Ser Ala Tyr His Asp Asn He Asn Asn Asp Gin Gly Leu Thr Tyr 
2060 2065 2070 

ttt ate gge etc agt gaa act gat acc ggt gaa tac tat tgg cgc agt 
Phe He Gly Leu Ser Glu Thr Asp Thr Gly Glu Tyr Tyr Trp Arg Ser 
2075 2080 2085 

gtc gat cac agt aaa ttc age gac ggt aaa ttc gee get aat gee tgg 
Val Asp His Ser Lys Phe Ser Asp Gly Lys Phe Ala Ala Asn Ala Trp 
2090 2095 2100 

agt gaa tgg cac aaa att gat tgt cca att aat cct tac cga age act 
Ser Glu Trp His Lys He Asp Cys Pro He Asn Pro Tyr Arg Ser Thr 
2105 2110 2115 2120 

ate cgt cct gtg atg tac aaa tec cgc ttg tat ctg etc tgg ttg gaa 
He Arg Pro Val Met Tyr Lys Ser Arg Leu Tyr Leu Leu Trp Leu Glu 
2125 2130 2135 

caa aag gag ate act aaa caa aca gga aat age aaa gat gge tat caa 
Gin Lys Glu He Thr Lys Gin Thr Gly Asn Ser Lys Asp Gly Tyr Gin 
2140 2145 2150 

acc gag aca gat tat cgt tat gag eta aaa ttg gcg cat ate cgt tat 
Thr Glu Thr Asp Tyr Arg Tyr Glu Leu Lys Leu Ala His He Arg Tyr 
2155 2160 2165 

gac ggt acc tgg aat acg cca ate act ttt gat gtc aat gaa aaa ata 
Asp Gly Thr Trp Asn Ihr Pro He Thr Phe Asp Val Asn Glu Lys He 
2170 2175 2180 

tec aag eta gaa ctg gca aaa aat aaa gcg cct ggg etc tat tgt get 
Ser Lys leu Glu Leu Ala Lys Asn Lys Ala Pro Gly Leu Tyr Cys Ala 
2185 2190 2195 2200 

ggt tat caa ggt gaa gat acg ttg ctg gtt atg ttt tat aac caa caa 
Gly Tyr Gin Gly Glu Asp Thr Leu Leu Val Met Phe Tyr Asn Gin Gin 
2205 2210 2215 

gat aca etc gat agt tat aaa acc get tea atg caa ggg eta tat ate 
Asp Thr Leu Asp Ser Tyr Lys Thr Ala Ser Met Gin Gly Leu Tyr He 
2220 2225 2230 



ttt gee gat atg gaa 
Phe Ala Asp Met Glu 
2235 



aaa gat atg acc gat 
Lys Asp Met Ihr Asp 
2240 

aaa caa ttc gat act 
Lys Gin Phe Asp *Ihr 



gga caa tac aaa tct 
Gly Gin Tyr Lys Ser 
2245 

aat agt gtc aga aga 
Asn Ser Val Arg Arg 



tat 
Tyr 



tat egg gac aac age tat 
Tyr Arg Asp Asn Ser Tyr 
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2250 2255 2260 

gtg aat aac cgc tat gca gag gat tat gaa att ccc tea teg gta aat 27742 
Val Asn Asn Arg Tyr Ala Glu Asp Tyr Glu lie Pro Ser Ser Val Asn 
2265 2270 2275 2280 

age cgt aaa ggc tat gat tgg gga gat tat tat etc agt atg gta tat 27790 
Ser Arg Lys Gly Tyr Asp Trp Gly Asp Tyr Tyr Leu Ser Met Val Tyr 
2285 2290 2295 

aac gga gat att cca act att agt tac aaa gec aca tea agt gat tta 27838 
Asn Gly Asp He Pro Thr lie Ser Tyr Lys Ala Thr Ser Ser Asp Leu 
2300 2305 2310 

aaa ate tat ate teg cca aaa tta aga att att cat aat gga tat gaa 27886 
Lys He Tyr He Ser Pro Lys Leu Arg lie He His Asn Gly Tyr Glu 
2315 2320 2325 

ggg cag caa cgc aat caa tgc aat eta atg aat aaa tat ggc aaa eta 27934 
Gly Gin Gin Arg Asn Gin Cys Asn Leu Met Asn Lys Tyr Gly Lys Leu 
2330 2335 2340 

ggt gat aaa ttt att gtt tat act age ttg gga gtt aat cca aat aat 27982 
Gly Asp Lys Phe He Val Tyr Thr Ser Leu Gly Val Asn Pro Asn Asn 
2345 2350 2355 2360 

teg tea aat aag ctg atg ttt tac ccc gtt tat caa tat aac gga aat 28030 
Ser Ser Asn Lys Leu Met Phe Tyr Pro Val Tyr Gin Tyr Asn Gly Asn 
2365 . 2370 2375 

gtc agt ggg ctt agt caa ggg aga tta eta ttc cac cgt gac acc aat 28078 
Val Ser Gly Leu Ser Gin Gly Arg Leu Leu Phe His Arg Asp Thr Asn 
2380 2385 2390 

tat tea tct aaa gta gaa get tgg att cct gga gca gga cgt tct eta 28126 
Tyr Ser Ser Lys Val Glu Ala Trp He Pro Gly Ala Gly Arg Ser Leu 
2395 2400 2405 

acc aat ccg aat get gec att ggt gat gat tat get aca gac teg tta 28174 
Thr Asn Pro Asn Ala Ala He Gly Asp Asp Tyr Ala Thr Asp Ser Leu 
2410 2415 2420 

aac aaa ccg aat gat ctt aag caa tac gtc tat atg act gac agt aaa 28222 
Asn Lys Pro Asn Asp Leu Lys Gin Tyr Val Tyr Met Thr Asp Ser Lys 
2425 2430 2435 2440 

ggt act get acc gat gtc tea gga cca gta gat ate aat act gca att 28270 
Gly Thr Ala Thr Asp Val Ser Gly Pro Val Asp He Asn Thr Ala He 
2445 2450 2455 

tec ccg gca aaa gtt cag gta aca gta aaa gee ggt age aaa gaa caa 28318 
Ser Pro Ala Lys Val Gin Val Thr Val Lys Ala Gly Ser Lys Glu Gin 
2460 2465 2470 

acg ttt acc gcg gat aaa aat gtc tec att cag cca tec cct age ttt 28366 
Thr Phe Thr Ala Asp Lys Asn Val Ser He Gin Pro Ser Pro Ser Phe 
2475 2480 2485 

gat gaa atg aat tat caa ttt aat get etc gaa ata gat ggc tea agt 28414 
Asp Glu Met Asn Tyr Gin Phe Asn Ala Leu Glu He Asp Gly Ser Ser 
2490 2495 2500 

ctg aat ttt act aac aat tea gec agt att gat att acc ttt acc gca 28462 
Leu Asn Phe Thr Asn Asn Ser Ala Ser He Asp He Thr Phe Thr Ala 
2505 2510 2515 2520 
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ttt gca gag gat gga cgt aaa ctg ggt tat gaa agt ttc agt att cct 
Phe Ala Glu Asp Gly Arg Lys Leu Gly Tyr Glu Ser Phe Ser lie Pro 
2525 2530 2535 



28510 



att acc cgc aag gtg agt act gat aat tec ctg acc ctg cgc cat aat 
He Thr Arg Lys Val Ser Thr Asp Asn Ser Leu Thr Leu Arg His Asn 
2540 2545 2550 



2855B 



gaa aat ggt gcg caa tat atg caa tgg gga gtc tat cgc att cgt ctt 
Glu Asn Gly Ala Gin Tyr Met Gin Trp Gly Val Tyr Arg He Arg Leu 
2555 2560 2565 



28606 



aat act tta ttt get cgc caa tta gtt gcg cga gec act acc ggt att 
Asn Thr Leu Phe Ala Arg Gin Leu Val Ala Arg Ala Thr Thr Gly He 
2570 2575 2580 



28654 



gat acg att ctg agt atg gaa act cag aat att cag gaa cca cag tta 
Asp Thr He Leu Ser Met Glu Thr Gin Asn He Gin Glu Pro Gin Leu 
2585 2590 2595 2600 



28702 



ggc aaa ggt ttc tac get acg ttc gtg ata cct ccg tat aac cca tea 
Gly Lys Gly Phe Tyr Ala Thr Phe Val He Pro Pro Tyr Asn Pro Ser 
2605 2610 2615 



28750 



act cat ggt gat gaa cgt tgg ttt aag ctt tat ate aaa cat gtt gtt 
Ihr His Gly Asp Glu Arg Trp Phe Lys Leu lyr He Lys His Val Val 
2620 2625 2630 



28798 



gat aat aat tea cat att ate tat tea ggt cag eta aaa gat aca aat 
Asp Asn Asn Ser His He He iyr Ser Gly Gin Leu Lys Asp Ihr Asn 
2635 2640 2645 



28846 



ata age acc acg tta ttt ate cct ctt gat gat gtt cca ttg aac caa 
He Ser Thr Thr Leu Phe He Pro Leu Asp Asp Val Pro Leu Asn Gin 
2650 2655 2660 



28894 



gat tac age gec aag gtt tac atg acc ttc aag aaa tea cca tea gat 28942 
Asp Tyr Ser Ala Lys Val Tyr Met Thr Phe Lys Lys Ser Pro Ser Asp 
2665 2670 2675 2680 

ggt acc tgg tgg ggc cct cac ttt gtt aga gat gat aaa gga ata gta 28990 
Gly Thr Trp Trp Gly Pro His Phe Val Arg Asp Asp Lys Gly He Val 
2685 2690 2695 



aca ata aac cct aaa tec att ttg acc cac ttt gag age gtc aat gtc 
Thr He Asn Pro Lys Ser He Leu Thr His Phe Glu Ser Val Asn Val 
2700 2705 2710 



29038 



ctg aat aat att agt age gaa cca atg gat ttc age ggc get aac age 29086 
Leu Asn Asn He Ser Ser Glu Pro Met Asp Phe Ser Gly Ala Asn Ser 
2715 2720 2725 

etc tat ttt tgg gaa ctg ttc tac tat acc ccg atg ctg gtt gee caa 29134 
Leu Tyr Phe Trp Glu Leu Phe Tyr iyr Thr Pro Met Leu Val Ala Gin 
2730 2735 2740 



cgt ttg ttg cat gag caa aac ttt gat gaa gcg aac cgc tgg ctg aaa 29182 
Arg Leu Leu His Glu Gin Asn Phe Asp Glu Ala Asn Arg Trp Leu Lys 
2745 2750 2755 2760 

tat gtc tgg age cca tec ggg tat att gtt cac ggc cag att cag aat 29230 
Tyr Val Trp Ser Pro Ser Gly Tyr He Val His Gly Gin He Gin Asn 
2765 2770 2775 



• 
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tat caa tgg aac gtc cgc ccg tta ttg gaa gat acc agt tgg aac agt 29278 
Tyr Gin Trp Asn Val Arg Pro Leu Leu Glu Asp *Ihr Ser Trp Asn Ser 
2780 2785 2790 

gat cct ttg gat tec gtc gat cct gac gcg gta gcg cag cac gat ccg 29326 
Asp Pro Leu Asp Ser Val Asp Pro Asp Ala Val Ala Gin His Asp Pro 
2795 2800 2805 

atg cac tat aaa gtt tea acc ttt atg cgc acc ctt gat ctg ttg ate 29374 
Met His Tyr Lys Val Ser Thr Phe Met Arg Thr Leu Asp Leu Leu lie 
2810 2815 2820 

gcg cgc ggc gac cat get tac cgc caa ttg gag cgc gat acg ctt aac 29422 
Ala Arg Gly Asp His Ala Tyr Arg Gin Leu Glu Arg Asp Ihr Leu Asn 
2825 2830 2835 2840 

gaa gcg aag atg tgg tat atg caa gcg ctg cat ctg tta ggc gat aaa 29470 
Glu Ala Lys Met Trp Tyr Met Gin Ala Leu His Leu Leu Gly Asp Lys 
2845 2850 2855 

cct tat ctg ccg ctg agt acc aca tgg aat gat cca cga ctg gac aaa 29518 
Pro lyr Leu Pro Leu Ser Thr Thr Trp Asn Asp Pro Arg Leu Asp Lys 
2860 2865 2870 

gee gcg gat att act acc caa agt get cat tec age tea ata gtc get 29566 
Ala Ala Asp lie Thr Thr Gin Ser Ala His Ser Ser Ser lie Val Ala 
2875 2880 2885 

ttg egg cag agt aca cog gcg ctt tta tea ttg cgc age gee aat acc 29614 
Leu Arg Gin Ser Ihr Pro Ala Leu Leu Ser Leu Arg Ser Ala Asn Thr 
2890 2895 2900 

ctg acc gat etc ttc ctg ccg caa ate aat gaa gtg atg atg aat tac 29662 
Leu Ihr Asp Leu Phe Leu Pro Gin He Asn Glu Val Met Met Asn Tyr 
2905 " 2910 2915 2920 

tgg caa aca tta get cag aga gta tac aac ctg cgc cac aac etc tct 29710 
Trp Gin Thr Leu Ala Gin Arg Val Tyr Asn Leu Arg His Asn Leu Ser 
2925 2930 2935 

ate gac ggt cag ccg tta tat ctg cca ate tat gee aca ccg gcg gac 29758 
He Asp Gly Gin Pro Leu Tyr Leu Pro He Tyr Ala Thr Pro Ala Asp 
2940 2945 2950 

ccg aaa gcg tta etc age gee get gtt gee act tct caa ggt gga ggc 29806 
Pro Lys Ala Leu Leu Ser Ala Ala Val Ala Thr Ser Gin Gly Gly Gly 
2955 2960 2965 

aag ctg ccg gag tea ttt atg tec ctg tgg cgt ttc ccg cac atg ctg 29854 
Lys Leu Pro Glu Ser Phe Met Ser Leu Trp Arg Phe Pro His Met Leu 
2970 2975 2980 

gaa aat get cgc age atg gtt age cag etc acc caa ttc ggc tec acg 29902 
Glu Asn Ala Arg Ser Met Val Ser Gin Leu Ihr Gin Phe Gly Ser Thr 
2985 2990 2995 3000 

tta caa aat att ate gaa cgt cag gac gca gaa gcg etc aat gcg tta 29950 
Leu Gin Asn He He Glu Arg Gin Asp Ala Glu Ala Leu Asn Ala Leu 
3005 3010 3015 

tta caa aat cag gee gca gag ctg ata ttg act aac ctg agt att caa 29998 
Leu Gin Asn Gin Ala Ala Glu Leu He Leu Thr Asn Leu Ser He Gin 
3020 3025 3030 

gac aaa acc att gaa gaa ctg gat gee gag aaa acc gtg ctg gaa aaa 30046 
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Asp Lys Thr He Glu Glu Leu Asp Ala Glu Lys Thr Val Leu Glu Lys 
3035 3040 3045 

tec aaa gcg gga gca caa teg cgc ttt gat age tat age aaa ctg cat 30094 
Ser Lys Ala Gly Ala Gin Ser Arg Phe Asp Ser Tyr Ser Lys Leu His 
3050 3055 3060 

gat gaa aac ate aac gee ggt gaa aac caa get atg acg eta cga gcg 30142 
Asp Glu Asn He Asn Ala Gly Glu Asn Gin Ala Met Thr Leu Arg Ala 
3065 3070 3075 3080 

tec gca gee ggg ctt acc acg gcg gtt cag gca tec cgt ctg gee ggc 30190 
Ser Ala Ala Gly Leu Thr Thr Ala Val Gin Ala Ser Arg Leu Ala Gly 
3085 3090 3095 

gca gcg get gat ctg gtg cct aac ate ttc ggc ttc gee ggt ggt ggt 30238 
Ala Ala Ala Asp Leu Val Pro Asn He Phe Gly Phe Ala Gly Gly Gly 
3100 3105 3110 

age cgt tgg ggg get ate get gag gcg acc ggc tat gta atg gaa ttt 30286 
Ser Arg Trp Gly Ala He Ala Glu Ala Thr Gly Tyr Val Met Glu Phe 
3115 3120 3125 

tec get aat gtt atg aat acc gaa gcg gat aaa att age caa tct gaa 30334 
Ser Ala Asn Val Met Asn Thr Glu Ala Asp Lys He Ser Gin Ser Glu 
3130 3135 3140 

acc tac cgt cgt cgc cgt cag gag tgg gaa att cag cgt aat aat gec 30382 
Thr Tyr Arg Arg Arg Arg Gin Glu Trp Glu He Gin Arg Asn Asn Ala 
3145 3150 3155 3160 

gaa gcg gag ctg aaa caa etc gat gee caa ctt aaa teg ctg gca gta 30430 
Glu Ala Glu Leu Lys Gin Leu Asp Ala Gin Leu Lys Ser Leu Ala Val 
3165 3170 3175 

cgc cgt gaa gee gee gta ttg caa aaa acc age ctg aaa acc caa caa 30478 
Arg Arg Glu Ala Ala Val Leu Gin Lys Thr Ser Leu Lys Thr Gin Gin 
3180 3185 3190 

gag cag acc caa gee caa ttg gee ttc ctg caa cgt aag ttc age aat 30526 
Glu Gin Thr Gin Ala Gin Leu Ala Phe Leu Gin Arg Lys Phe Ser Asn 
3195 3200 3205 

caa gcg ttg tac aac tgg eta cgt ggc cga ctg gca gca att tac ttc 30574 
Gin Ala Leu Tyr Asn Trp Leu Arg Gly Arg Leu Ala Ala He Tyr Phe 
3210 3215 3220 

caa ttc tac gac ttg get ate gcg cgt tgt tta atg gca gag cag get 30622 
Gin Phe Tyr Asp Leu Ala He Ala Arg Cys Leu Met Ala Glu Gin Ala 
3225 3230 3235 3240 

tac cgt tgg gaa att age gat gac tct get cgc ttt att aaa ccg ggc 30670 
Tyr Arg Trp Glu He Ser Asp Asp Ser Ala Arg Phe He Lys Pro Gly 
3245 3250 3255 

gee tgg caa gga acc tat gca ggt ctg ctg gca ggt gaa acc ttg atg 30718 
Ala Trp Gin Gly Thr Tyr Ala Gly Leu Leu Ala Gly Glu Thr Leu Met 
3260 3265 * 3270 

eta agt ttg gca caa atg gaa gac gee cat tta aga cgc gat aaa cgc 30766 
Leu Ser Leu Ala Gin Met Glu Asp Ala His Leu Arg Arg Asp Lys Arg 
3275 3280 3285 

gca tta gag gtc gaa cgt aca gta teg ctg gec gaa att tat get ggt 30814 
Ala Leu Glu Val Glu Arg Thr Val Ser Leu Ala Glu He Tyr Ala Gly 



WO 99/42589 PCT/EP99/01015 

-46 - 



3290 3295 3300 

eta ccg caa gat aaa ggc cca ttc tec ctg acg caa gaa ate gag aag 
Leu Pro Gin Asp Lys Gly Pro Phe Ser Leu Thr Gin Glu lie Glu Lys 
3305 3310 3315 3320 

ctg gtg aat gca ggt tea ggc age gee ggc agt ggt aat aat aat ttg 
Leu Val Asn Ala Gly Ser Gly Ser Ala Gly Ser Gly Asn Asn Asn Leu 
3325 3330 3335 

gca ttt ggc gec ggc acg gac act aaa act tct ttg cag gca tec att 
Ala Phe Gly Ala Gly Thr Asp Thr Lys Thr Ser Leu Gin Ala Ser lie 
3340 3345 3350 

tea tta get gat tta aaa att cgt gag gat tac ccg gaa tct att ggc 
Ser Leu Ala Asp Leu Lys lie Arg Glu Asp Tyr Pro Glu Ser He Gly 
3355 3360 3365 

aaa ate cga cgc ate aaa cag ate age gtt ace ctg ccg gcg eta ttg 
Lys He Arg Arg He Lys Gin He Ser Val Thr Leu Pro Ala Leu Leu 
3370 3375 3380 

gga cct tat cag gat gtg cag gca ata tta tct tac ggc gat aaa gee 
Gly Pro Tyr Gin Asp Val Gin Ala He Leu Ser Tyr Gly Asp Lys Ala 
3385 3390 3395 3400 

gga tta gcg aac ggc tgt gca gcg ctg gee gtt tec cac ggt acg aat 
Gly Leu Ala Asn Gly Cys Ala Ala Leu Ala Val Ser His Gly Thr Asn 
3405 3410 3415 

gac age ggt caa ttc cag etc gat ttc aac gat ggc aaa ttc ctg ccg 
Asp Ser Gly Gin Phe Gin Leu Asp Phe Asn Asp Gly Lys Phe Leu Pro 
3420 3425 3430 

ttt gaa ggt ate gec att gat caa ggt acg eta aca ctg agt ttt cct 
Phe Glu Gly He Ala He Asp Gin Gly Thr Leu Thr Leu Ser Phe Pro 
3435 3440 3445 

aat gca tea acg cca gee aaa ggt aaa caa gee act atg tta aaa ace 
Asn Ala Ser Thr Pro Ala Lys Gly Lys Gin Ala Thr Met Leu Lys Thr 
3450 3455 3460 

ctg aac gat ate att ttg cat att cgc tac ace att aag taa 
Leu Asn Asp He He Leu His He Arg Tyr Thr He Lys 
3465 3470 3475 



30862 



30910 



30958 



31006 



31054 



31102 



31150 



31198 



31246 



31294 



31336 



ccatcccaac acagaactaa gacaggcccc gaatcggggt ctggtaagga gtttct atg 31395 

Met 

cag aat tea cag aca ttc age atg ace gag ctg tea tta cct aag ggc 31443 
Gin Asn Ser Gin Thr Phe Ser Met Thr Glu Leu Ser Leu Pro Lys Gly 
3480 3485 3490 3495 

ggc ggc gee att ace ggt atg ggt gaa gca tta acg ccg gee ggg ccg 31491 
Gly Gly Ala He Thr Gly Met Gly Glu Ala Leu Thr Pro Ala Gly Pro 
3500 3505 3510 

gat ggt atg gca gee tta teg ctg cca ttg ccc att tct gee gga cgt 31539 
Asp Gly Met Ala Ala Leu Ser Leu Pro Leu Pro He Ser Ala Gly Arg 
3515 3520 3525 



ggt tat gec ccc teg etc acg ctg aac tac aac age gga acc ggt aac 
Gly Tyr Ala Pro Ser Leu Thr Leu Asn Tyr Asn Ser Gly Thr Gly Asn 
3530 3535 3540 



31587 
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agc ccg ttc ggt etc ggt tgg gac tgt aac gtc atg aca att cgt cgt 31635 
Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr lie Arg Arg 
3545 3550 3555 

cgc acc agt acc ggc gtg ccg aat tat gat gaa acc gat act ttt ctg 31683 
Arg Thr Ser Thr Gly Val Pro Asn Tyr Asp Glu Thr Asp Thr Phe Leu 
3560 3565 3570 3575 

ggg ccg gaa ggt gaa gtg ttg gtc gta gca tta aat gag gca ggt caa < 31731 
Gly Pro Glu Gly Glu Val Leu Val Val Ala Leu Asn Glu Ala Gly Gin 
3580 3585 3590 

get gat ate cgc agt gaa tec tea tta cag ggc ate aat ttg ggg atg 31779 
Ala Asp He Arg Ser Glu Ser Ser Leu Gin Gly He Asn Leu Gly Met 
3595 3600 " 3605 

acc ttc acc gtt acc ggt tat cgc tec cgt ttg gaa age cac ttt age 31827 
Thr Phe Thr Val Thr Gly Tyr Arg Ser Arg Leu Glu Ser His Phe Ser 
3610 3615 3620 

egg ttg gaa tac tgg caa ccc caa aca aca ggc gca acc gat ttc tgg 31875 
Arg Leu Glu Tyr Trp Gin Pro Gin Thr Thr Gly Ala Thr Asp Phe Trp 
3625 3630 3635 

ctg ata tac age ccc gac gga caa gee cat tta ctg ggc aaa aat cct 31923 
Leu He Tyr Ser Pro Asp Gly Gin Ala His Leu Leu Gly Lys Asn Pro 
3640 3645 3650 3655 

caa gca cgc ate age aat cca eta aat gtt aac caa aca gcg caa tgg 31971 
Gin Ala Arg He Ser Asn Pro Leu Asn Val Asn Gin Thr Ala Gin Trp 
3660 3665 3670 

eta ttg gaa gee teg gta tea tec cac ggc gag cag att tat tat cag 32019 
Leu Leu Glu Ala Ser Val Ser Ser His Gly Glu Gin He Tyr Tyr Gin 
3675 3680 3685 

tat cga gee gaa gat gaa act gat tgc gaa act gac gaa etc aca gee 32067 
Tyr Arg Ala Glu Asp Glu Thr Asp Cys Glu Thr Asp Glu Leu Thr Ala 
3690 3695 3700 

cac ccg aac aca acc gtc cag cgc tac ctg caa gta gta cat tac ggt 32115 
His Pro Asn Thr Thr Val Gin Arg Tyr Leu Gin Val Val His TVr Gly 
3705 3710 3715 

aat eta acc gec age gaa gta ttt ccc acg eta aat gga gat gat cca 32163 
Asn Leu Thr Ala Ser Glu Val Phe Pro Thr Leu Asn Gly Asp Asp Pro 
3720 3725 3730 3735 

etc aaa tct ggc tgg ttg ttc tgt tta gta ttt gat tac ggt gag cgc 32211 
Leu Lys Ser Gly Trp Leu Phe Cys Leu Val Phe Asp Tyr Gly Glu Arg 
3740 3745 3750 

aaa aac age tta tct gaa atg ccg cca ttt aaa gee aca agt aac tgg 32259 
Lys Asn Ser Leu Ser Glu Met Pro Pro Phe Lys Ala Thr Ser Asn Trp 
3755 3760 3765 

ctt tgc cgc aaa gac cgt ttt tec cgt tat gaa tac ggt ttt gca ttg 32307 
Leu Cys Arg Lys Asp Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Ala Leu 
3770 3775 3780 

cgc acc egg cgc tta tgt cgc caa ata ctg atg ttt cac cgt ctg caa 32355 
Arg Thr Arg Arg Leu Cys Arg Gin He Leu Met Phe His Arg Leu Gin 
3785 3790 3795 

acc ctg tct ggt cag gca aaa ggc gac gat gaa ccc gca tta gtt tea 32403 
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Thr Leu Ser Gly Gin Ala Lys Gly Asp Asp Glu Pro Ala Leu Val Ser 
3800 3805 3810 3815 

cat ctg ata ctg gat tat gac gaa aac gcg gtg gtc agt acg etc gtt 32451 
Arg Leu He Leu Asp Tyr Asp Glu Asn Ala Val Val Ser Thr Leu Val 
3820 3825 3830 

tct gtc cgc cga gtg gga cat gag caa gat ggc aca acg gcg gtc gec 32499 
Ser Val Arg Arg Val Gly His Glu Gin Asp Gly Thr Thr Ala Val Ala 
3835 3840 3845 

ctg ccg cca ttg gaa ctg get tat cag cct ttt gaa cca gaa caa aaa 
Leu Pro Pro Leu Glu Leu Ala Tyr Gin Pro Phe Glu Pro Glu Gin Lys 
3850 3855 3860 

gca etc tgg cga cca atg gat gta ctg gcg aat ttc aac ace ate caa 
Ala Leu Trp Arg Pro Met Asp Val Leu Ala Asn Phe Asn Thr He Gin 
3865 3870 3875 

cgc tgg caa ctg ctt gat ctg caa ggc gaa ggc gta ccc ggt att ctg 32643 
Ara Trp Gin Leu Leu Asp Leu Gin Gly Glu Gly Val Pro Gly He leu 
SSO 3885 3890 3895 



32547 



32595 



tat cag gat aaa aat ggc tgg tgg tat cga tct get caa cgt cag aca 32691 
Tvr Gin Asp Lys Asn Gly Trp Trp Tyr Arg Ser Ala Gin Arg Gin Thr 
3900 3905 3910 

ggg gaa gag atg aat gcg gtc ace tgg ggc aaa atg caa etc ctt cct 32739 
Gly Glu Glu Met Asn Ala Val Thr Trp Gly Lys Met Gin leu leu Pro 
3915 3920 3925 

ate acg ccc get att cag gat aac gee tea ctg atg gat att aat ggt 32787 
He Thr Pro Ala He Gin Asp Asn Ala Ser Leu Met Asp He Asn Gly 
3930 3935 3940 

gat ggg caa ctg gat tgg gtt ate acc ggt ccg ggg eta agg ggt tat 32835 
Asp Gly Gin Leu Asp Trp Val He Thr Gly Pro Gly leu Arg Gly Tyr 
3945 3950 3955 

cac age cag cat cca gat ggc agt tgg aca cgt ttt acg ccg ttg cac 32883 
His Ser Gin His Pro Asp Gly Ser Trp Thr Arg Phe Thr Pro Leu His 
3960 3965 3970 3975 

gee tta ccg ata gaa tat acc cat ccc cgc gee caa ctt gcg gat tta 32931 
Ala leu Pro lie Glu Tyr Thr His Pro Arg Ala Gin leu Ala Asp leu 
3980 3985 3990 



atg ggg gee ggg ctg tec gat tta gtg ctg att ggt ccc aaa age gtg 
Met Gly Ala Gly Leu Ser Asp Leu Val Leu He Gly Pro Lys Ser Val 
3995 4000 4005 

cgt ttg tat gec aat aac cgt gat ggt ttt acc gaa gga egg gat gtg 
Arg Leu Tyr Ala Asn Asn Arg Asp Gly Phe Thr Glu Gly Arg Asp Val 
4010 4015 4020 



32979 



33027 



gtg caa tec ggt ggt ate acc ctg ccg tta ccg ggc gec gat gcg cgt 33075 
Val Gin Ser Gly Gly He Thr Leu Pro Leu Pro Gly Ala Asp Ala Arg 
4025 4030 4035 

aag tta gtg gee ttt age gac gta etc ggt tea ggc caa gca cat ttg 33123 
Lys Leu Val Ala Phe Ser Asp Val Leu Gly Ser Gly Gin Ala His Leu 
4040 4045 4050 4055 



gtt gaa gtt agt gcg acg aaa gtc acc tgc tgg cca aat ctg gga cat 
Val Glu Val Ser Ala Thr Lys Val Thr Cys Trp Pro Asn Leu Gly His 



33171 
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4060 4065 4070 

ggc cgt ttt ggt cag cca ate aca ttg ccg gga ttt age caa tec gee 33219 
Gly Arg Phe Gly Gin Pro lie Tnr Leu Pro Gly Phe Ser Gin Ser Ala 
4075 4080 4085 

gee aat ttt aat cct gat cga gtt cat ctg gee gat ctg gac ggt agt 33267 
Ala Asn Phe Asn Pro Asp Arg Val His Leu Ala Asp Leu Asp Gly Ser 
4090 4095 4100 

ggt cct gec gat ctg att tat gtt cat get gac cat ctg gat att ttc 33315 
Gly Pro Ala Asp Leu lie Tyr Val His Ala Asp His Leu Asp lie Phe 
4105 4110 4115 

age aat gaa agt ggt aac ggt ttt gca caa cca ttc aca etc cgt ttt 33363 
Ser Asn Glu Ser Gly Asn Gly Phe Ala Gin Pro Phe Thr Leu Arg Phe 
4120 4125 4130 4135 

cct gac ggc ctg cgt ttt gat gat act tgc cag eta caa gtg get gat 33411 
Pro Asp Gly Leu Arg Phe Asp Asp Thr Cys Gin Leu Gin Val Ala Asp 
4140 4145 4150 

gta cag gga tta ggg gtt gtc age ctg ate ctg age gta ccg cat atg 33459 
Val Gin Gly Leu Gly Val Val Ser Leu He Leu Ser Val Pro His Met 
4155 4160 4165 

gcg cca cac cat tgg cgc tgc gat ctg ace aac gcg aaa ccg tgg tta 33507 
Ala Pro His His Trp Arg Cys Asp Leu Thr Asn Ala Lys Pro Trp Leu 
4170 4175 ' 4180 

etc agt gaa atg aac aac aac atg gga gee cat cac ace ctg cat tac 33555 
Leu Ser Glu Met Asn Asn Asn Met Gly Ala His His Thr Leu His Tyr 
4185 4190 4195 

cgt age tec gtc cag ttt tgg ctg gat gaa aaa gee gca gee tta get 33603 
Arg Ser Ser Val Gin Phe Trp Leu Asp Glu Lys Ala Ala Ala Leu Ala 
4200 4205 4210 4215 

acc gga caa aca ccg gtc tgt tac ctg ccc ttc ccg gtc cat acc ctg 33651 
Thr Gly Gin Thr Pro Val Cys Tyr Leu Pro Phe Pro Val His Thr Leu 
4220 4225 4230 

tgg caa aca gaa acc gag gat gaa ate age ggc aat aaa tta gtg acc 33699 
Trp Gin Thr Glu Thr Glu Asp Glu He Ser Gly Asn Lys Leu Val Thr 
4235 4240 4245 

act tta cgt tac get cac ggc gee tgg gat gga cgt gag egg gaa ttt 33747 
Thr Leu Arg Tyr Ala His Gly Ala Trp Asp Gly Arg Glu Arg Glu Phe 
4250 4255 4260 

cgc ggc ttt ggc tat gtt gag cag aca gac age cat caa ctg get caa 33795 
Arg Gly Phe Gly Tyr Val Glu Gin Ihr Asp Ser His Gin Leu Ala Gin 
4265 4270 4275 

ggc aat gcg ccg gaa cgt aca tea ccg gca ctt acc aaa aac tgg tat 33843 
Gly Asn Ala Pro Glu Arg Thr Ser Pro Ala Leu Thr Lys Asn Trp Tyr 
4280 4285 4290 4295 

gee acc gga ate cct gag gta gac aat acg eta tct gec ggg tat tgg 33891 
Ala Thr Gly He Pro Glu Val Asp Asn Thr Leu Ser Ala Gly Tyr Trp 
4300 4305 4310 

cgc ggt gat acg cag get ttc act ggt ttt acg cca cac ttt act etc 33939 
Arg Gly Asp Thr Gin Ala Phe Thr Gly Phe Tnr Pro His Phe Thr Leu 
4315 4320 4325 
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tqq aaa gag ggc aaa gat gtt cca ctg aca ccg gaa gat gac cac aat 33987 
Trp Lys Glu Gly Lys Asp Val Pro Leu Thr Pro Glu Asp Asp His Asn 
4330 4335 4340 

ctg tac tgg tta aac egg gca eta aaa ggt caa cca ctg cgt agt gaa 
Leu Tyr Trp Leu Asn Arg Ala Leu Lys Gly Gin Pro Leu Arg Ser Glu 
4345 4350 4355 

etc tac ggg eta gat ggc age gca cag cag aag ate ccc tat aca gtg 
Leu Tyr Gly Leu Asp Gly Ser Ala Gin Gin Lys lie Pro Tyr Thr Val 
4360 4365 4370 4375 

act gaa tec cgc cca caa gtg cgc caa tta caa gat aac act ace ctt 34131 
Thr Glu Ser Arg Pro Gin Val Arg Gin Leu Gin Asp Asn Thr Thr Ifiu 
4380 4385 4390 



34035 



34083 



34227 



34275 



34323 



tec ccg gtg etc tgg gee tea gtg gtg gaa agt cgt agt tat cac tat 34179 
Ser Pro Val Leu Trp Ala Ser Val Val Glu Ser Arg Ser Tyr His Tyr 
4395 4400 4405 

gaa cgt ate ate age gat ccc caa tgc aat cag gat ate act ctg tec 
Glu Arg He lie Ser Asp Pro Gin Cys Asn Gin Asp lie Thr Leu Ser 
4410 4415 4420 

agt gac eta ttc ggg caa ccg ctg aaa cag gtt tea gtg caa tat ccc 
Ser Asp Leu Phe Gly Gin Pro Leu Lys Gin Val Ser Val Gin Tyr Pro 
4425 4430 4435 

cgc cgc aat aaa cca aca ace aat ccg tat ccc gat aca eta cca gat 
Arg Arg Asn Lys Pro Thr Thr Asn Pro iyr Pro Asp Thr Leu Pro Asp 
4440 4445 4450 4455 

act ctg ttt gee age agt tat gac gac caa caa caa eta ttg egg tta 34371 
Ihr Leu Phe Ala Ser Ser Tyr Asp Asp Gin Gin Gin Leu I^u Arg Leu 
4460 4465 4470 

ace tac cag caa tec agt tgg cat cat eta att get aat gaa etc aga 34419 
Thr Tyr Gin Gin Ser Ser Trp His His Leu He Ala Asn Glu lea Arg 
4475 4480 4485 

gtg tta gga tta ccg gat ggt aca cgc agt gat get ttc act tac gat 
Val Leu Gly Leu Pro Asp Gly Ihr Arg Ser Asp Ala Phe Thr Tyr Asp 
4490 4495 4500 

get aaa cac gtg cct gtt gat ggt tta aat ctg gaa get eta tgt get 
Ala Lys His Val Pro Val Asp Gly Leu Asn Leu Glu Ala Leu Cys Ala 
4505 4510 4515 

gaa aat age ctg att gee gat gat aaa cct cgc gaa tac etc aac cag 
Glu Asn Ser Leu He Ala Asp Asp Lys Pro Arg Glu Tyr lea Asn Gin 
4520 4525 4530 4535 

caa cga acg ttc tat ace gat ggg aaa acc gat gga aaa aat cca acg 34611 
Gin Arg Thr Phe Tyr Thr Asp Gly Lys Thr Asp Gly Lys Asn Pro Thr 
4540 4545 4550 



34467 



34515 



34563 



cca ctg aaa aca ccg aca cga cag get tta ate gec ttt acc gaa acg 
Pro Leu Lys Thr Pro Thr Arg Gin Ala Leu He Ala Phe Thr Glu Thr 
4555 4560 4565 

gcg gta tta acg gaa tct ctg tta tec gca ttt gat ggc ggt ate acg 
Ala Val Leu Thr Glu Ser Leu Leu Ser Ala Phe Asp Gly Gly He Thr 
4570 4575 4580 



34659 



34707 
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oca gat gaa tta ccc ggc ctt ctg aca caa gca gga tac caa caa gaa 34755 
Pro Asp Glu Leu Pro Gly Leu Leu Thr Gin Ala Gly Tyr Gin Gin Glu 



cct tat ctg ttc cca etc agt ggc gaa aac caa gtc tgg gta gca cgc 34803 
Pro Tyr Leu Phe Pro Leu Ser Gly Glu Asn Gin Val Trp Val Ala Arg 
4600 4605 4610 4615 

aaa ggc tat acc gat tac gga act gag gta caa ttt tgg cgt cct gtc 34851 
Lys Gly Tyr Thr Asp Tyr Gly Thr Glu Val Gin Phe Trp Arg Pro Val 
4620 4625 4630 

gca caa cgt aac acc cag tta acc ggg aaa acg act eta aaa tgg gat 34899 
Ala Gin Arg Asn Thr Gin Leu Thr Gly Lys Thr Thr Leu Lys Trp Asp 
4635 4640 4645 

acc cac tac tgt gtc ate act caa acc caa gac gcg get ggt ttg act 34947 
Thr His Tyr Cys Val lie Thr Gin Thr Gin Asp Ala Ala Gly Leu Thr 
4650 4655 4660 

gtc tea gee aat tat gac tgg cgt ttt etc aca cct atg caa ctg act 34995 
Val Ser Ala Asn Tyr Asp Trp Arg Phe Leu Thr Pro Mat Gin Leu Thr 
4665 4670 4675 

gat ate aac gat aat gtg cat ate ata acc ttg gat gcg eta gga cgc 35043 
Asp He Asn Asp Asn Val His He He Thr Leu Asp Ala Leu Gly Arg 
4680 4685 4690 4695 

cct gtc act caa cgt ttc tgg gga ate gaa aat ggt gtg gca aca ggt 35091 
Pro Val Thr Gin Arg Phe Trp Gly He Glu Asn Gly Val Ala Thr Gly 
4700 4705 4710 

tac tct tea cca gaa gca aaa cca ttc act cca cca gtc gat gtc aat 35139 
Tyr Ser Ser Pro Glu Ala Lys Pro Phe Thr Pro Pro Val Asp Val Asn 
4715 4720 4725 

get gee att get ctg acc gga cca etc cct gtc gcg cag tgt ctg gtc 35187 
Ala Ala He Ala Leu Thr Gly Pro Leu Pro Val Ala Gin Cys Leu Val 
4730 4735 4740 

tat gcg ccg gac agt tgg atg ccg eta ttc ggt cag gaa acc ttc aac 35235 
Tyr Ala Pro Asp Ser Trp Met Pro Leu Phe Gly Gin Glu Thr Phe Asn 
4745 4750 4755 

aca tta acg cag gaa gag caa aag aca ctg cgt gat tta egg att ate 35283 
Thr Leu Thr Gin Glu Glu Gin Lys Thr Leu Arg Asp Leu Arg He He 
4760 4765 4770 4775 

aca gaa gat tgg cgt att tgc gca ctg get cgc cgc cgt tgg eta caa 35331 
Thr Glu Asp Trp Arg He Cys Ala Leu Ala Arg Arg Arg Trp Leu Gin 
4780 4785 4790 

agt caa aaa gee ggc aca cca ttg gtt aag ctg tta acc aac age ate 35379 
Ser Gin Lys Ala Gly Thr Pro Leu Val Lys Leu Leu Thr Asn Ser He 
4795 4800 4805 

ggt tta cct ccc cac aac etc atg ctg get acg gac cgt tat gac cgt 35427 
Gly Leu Pro Pro His Asn Leu Met Leu Ala Thr Asp Arg Tyr Asp Arg 
4810 4815 4820 

gat tct gaa cag caa att cgt caa caa gtc gca ttc agt gat ggt ttt 35475 
Asp Ser Glu Gin Gin He Arg Gin Gin Val Ala Phe Ser Asp Gly Phe 
4825 4830 4835 

ggc cgt ttg ttg caa gcg get gtg egg cat gag gca ggc gaa gec tgg 35523 



4585 



4590 



4595 
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Gly Arg Leu Leu Gin Ala Ala Val Arg His Glu Ala Gly Glu Ala Trp 
4840 4845 4850 4855 

caa cgt aac caa gac ggt tct ctg gtg aca aaa atg gaa gat acc aaa 
Gin Arg Asn Gin Asp Gly Ser Leu Val Thr Lys Met Glu Asp Ihr Lys 
4860 4865 4870 

acg cgc tgg gcg att acg gga cgc act gaa tat gac aat aag ggg cag 
Thr Arg Trp Ala He Ihr Gly Arg Thr Glu Tyr Asp Asn Lys Gly Gin 
4875 4880 4885 

gcg ata cga act tat cag ccc tat ttc etc aat gac tgg cga tat gtg 
Ala He Arg Thr Tyr Gin Pro Tyr Phe Leu Asn Asp Trp Arg Tyr Val 
4890 4895 4900 

agt gat gac age gec aga aaa gag gec tat gec gat act cat ate tat 
Ser Asp Asp Ser Ala Arg Lys Glu Ala iyr Ala Asp Ihr His He Tyr 
4905 4910 4915 

gat cog att ggg egg gaa ate caa gtt ate acg gca aaa ggc tgg ctg 
Asp Pro He Gly Arg Glu He Gin Val He Thr Ala Lys Gly Trp Leu 
4920 4925 4930 4935 

egg cag aac caa tat ttc ccg tgg ttt acc gtg agt gaa gat gaa aat 
Arg Gin Asn Gin Tyr Phe Pro Trp Phe Thr Val Ser Glu Asp Glu Asn 
4940 4945 4950 

gat ttg tec get gac gcg . etc gtg taa ttgaatcaag attegctegt 
Asp Leu Ser Ala Asp Ala Leu Val 

4955 4960 

ttaatgttaa egagegaata taatatacct aatagatttc gagttgeage gcggcggcaa 35918 

gtgaacgaat ccccaggagc atagataact atgtgactgg ggtgagtgaa agcagccaac 35978 

aaagcagcag cttgaaagat gaagggtata aataagaaac tgcattgtga gttctaaata 36038 

gagtagcagc atattttatt gecttttatt tcataggtaa taaaattcaa ttgctgtaaa 36098 

aatctgtcat catgagaact aaaaataaca actttctctt ctgeaagaga aatcaataat 36158 

tcaattaaaa atgttataga atctgaatca agaccatttg ttggctcatc aaaaatataa 36218 

acatccgcat eggtaataaa agctgatgtc aatagaaatt tcttttttat cocaagtgac 36278 

atatgtccat actcaatacc agaataatta gatataccaa aaccatttaa atagtaatct 36338 ' 

aattgatatt ttaaattact tttcctataa cgctgactta aattaatcac atccattccc 36398 

gtgatgaaat tataaaagtt aacattatcc gatagataaa aaccatgetg ttgeaaatta 36458 

aateggctet tttctccctt ttttataaaa ttaaccattc cttttttaac cttatttaca 36518 

ccagcaatac ttgaaagaaa agtcgtttta cccgccecat taactcccgc aatacggttt 36578 

aatccaaccc gaaaatcaca attgactcct gaaaaaatag tcttaccatt aataacaacc 36638 

tctaacccaa taacttcaag cataaataac ccctaaaaat aaegtaaaaa agaaaataac 36698 

accaacaata ataattttcg tgtattgcgt tctcaacaga gaaatagaag aaacaataat 36758 

agaaagaaaa gcataagata aaaatataat cacaggaaaa gatttaacaa caagaaagca 36818 

aaacaaagca aataaaaaaa caaagaaata ccataattaa aaaagaatat 36878 



35571 
35619 
35667 
35715 
35763 
35B11 
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tttccgcaca gataaaaagt tggacaaata tgaaagataa tttatttcaa tatatgatag 36938 
attataaaat aacaacatgc atatatataa aacaacactg gcatatatta atgatatata 36998 
atcagccttg tttgggattt gagaaaaggc actttcacat aatagatata aaagcagaac 37058 
agataatgcc ataataagca cagacatttt atttttatta aaacaataac gaagattcat 37118 
tatataaggc aatgaaaaaa aacctgatga aaataatttt ttatttctat taattatata 37178 
acatggtgtg aaattcaaat: ataatatcaa tgctactaat ggaataacta atgtaaaaat 37238 
caaatcatat aatattccac tcctgaatga tgccgccaga agaaagaaca cagcaacaat 37298 
aaaaaaatgc aaaaaactta attcaaataa gcaaaatcca attacagcaa aagaaactat 37358 
caaaaaaaac acagatgaaa ggtaatgcaa ataattaaca ttttcgtaaa aaaaacctat 37418 
aaagaagaaa ataactatcg gaaaagcact ataaataaaa aaaaogatac gactaaaaaa 37478 
caacgttttt ttacctacca aagaaacgat gattgaattc tcctttgcag aaggaaaaaa 37538 
ccttatgtta atcaaataaa ataccatata taccattaaa gatatggcag t-aaaafaaaa 37598 
tgattttatg tagccatctg gaataataat attggaagat aaagttatta aaacctcaaa 37658 
gataccactg aactttgccg gaagtaataa aagaaaaagg aatataatga catttttatt 37718 
cccagacgca aatttcttta tcctaccttt atattccaag gcatcagcga ttattaaatt 37778 
catactgcct ctctaaaacc aaaatctaaa taatgtcctt ggtgaatctt tagggaattt 37838 
cgtcctggaa tgcaaatata aatagttact gaaaacaata cattgatttt taattaaata 37898 
ctggcgatat gaccttaatg atgctacttt attttccagt attcaattcg 37948 

<210> 12 
<211> 954 
<212> PRT 

<213> Photorhabdus luminescens 
<400> 12 

Met Lys Asn He Asp Pro Lys Leu Tyr Gin Lys Thr Pro Val Val Asn 
1 5 10 15 

He Tyr Asp Asn Arg Gly Leu Thr He Arg Asn He Asp Phe His Arg 
20 25 30 

Thr Thr Ala Asn Gly Asp Thr Asp He Arg He Thr Arg His Gin Tyr 
35 40 45 

Asp Ser Leu Gly His Leu Ser Gin Ser Thr Asp Pro Arg Leu Tyr Glu 
50 55 60 

Ala Lys Gin Lys Ser Asn Phe Leu Trp Gin Tyr Asp Leu Thr Gly Asn 



He Leu Cys Thr Glu Ser Val Asp Ala Gly Arg Thr Val Thr Leu Asn 
85 90 95 

Asp He Glu Gly Arg Pro Leu Leu Thr Val Thr Ala Thr Gly Val He 
100 105 110 

Gin Thr Arg Gin Tyr Glu Thr Ser Ser Leu Pro Gly Arg Leu Leu Ser 



65 



70 



75 



80 
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115 



120 



125 



Val Thr Glu Gin He Pro Glu Lys Thr Ser Arg He Thr Glu Arg leu 
130 135 140 

He Trp Ala Gly Asn Ser Glu Ala Glu Lys Asn His Asn Leu Ala Ser 
145 150 155 160 

Gin Cys Val Arg His iyr Asp Thr Ala Gly Val Thr Arg Leu Glu Ser 
165 170 175 

Leu Ser Leu Thr Gly Thr Val Leu Ser Gin Ser Ser Gin Leu Leu Ser 
180 185 ' 190 

Asp Thr Gin Glu Ala Ser Trp Thr Gly Asp Asn Glu Thr Val Trp Gin 
195 200 205 

Asn Met Leu Ala Asp Asp He Tyr Thr Thr Leu Ser Ala Phe Asp Ala 
210 215 220 

Thr Gly Ala Leu Leu Thr Gin Thr Asp Ala Lys Gly Asn He Gin Arg 
225 230 235 240 

Leu Thr iyr Asp Val Ala Gly Gin Leu Asn Gly Ser Trp Leu Thr Leu 
245 250 255 

Lys Asp Gin Pro Glu Gin Val He He Arg Ser Leu Thr iyr Ser Ala 
260 265 270 

Ala Gly Gin Lys Leu Arg Glu Glu His Gly Asn Gly Val He Thr Glu 
275 280 285 

Tyr Ser Tyr Glu Pro Glu Thr Gin Gin Leu He Gly Thr Lys Thr His 
290 295 300 

Arg Pro Ser Asp Ala Lys Val Leu Gin Asp Leu Arg Tyr Glu Tyr Asp 
305 310 315 320 

Pro Val Gly Asn Val He Ser He Arg Asn Asp Ala Glu Ala Thr Arg 
325 330 335 

Phe Trp His Asn Gin Lys Val Ala Pro Glu Asn Thr Tyr Thr Tyr Asp 
340 345 350 

Ser Leu Tyr Gin Leu He Ser Ala Thr Gly Arg Glu Met Ala Asn He 
355 360 365 1 

Gly Gin Gin Ser Asn Gin Leu Pro Ser Leu Thr Leu Pro Ser Asp Asn 
370 375 380 

Asn Thr Tyr Thr Asn Tyr Thr Arg Thr Tyr Thr Tyr Asp Arg Gly Gly 
385 390 395 400 

Asn Leu Thr Lys He Gin His Ser Ser Pro Ala Thr Gin Asn Asn Tyr 
405 " 410 415 

Thr Thr Asn He Thr Val Ser Asn Arg Ser Asn Arg Ala Val Leu Ser 
420 425 430 

Thr Leu Thr Glu Asp Pro Ala Gin Val Asp Ala Leu Phe Asp Ala Gly 
435 440 445 

Gly His Gin Asn Thr Leu He Ser Gly Gin Asn Leu Asn Trp Asn Thr 



450 



455 



460 
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Arg Gly Glu Leu Gin His Val Thr Leu Val Lys Arg Asp Lys Gly Ala 
465 470 475 480 

Asn Asp Asp Arg Glu Trp Tyr Arg Tyr Ser Ser Asp Gly Arg Arg lie 
485 490 495 

Leu Lys lie Asn Glu Gin Gin Thr Ser Ser Asn Ser Gin Thr Gin Arg 
500 505 510 

lie Thr iyr Leu Pro Ser Leu Glu Leu Arg Leu Thr Gin Asn Ser Thr 
515 520 525 

lie Thr Thr Glu Asp Leu Gin Val lie Thr Val Gly Glu Ala Gly Arg 
530 535 540 

Ala Gin Val Arg Val Leu His Trp Asp Ser Gly Gin Pro Glu Asp lie 
545 550 555 560 

Asp Asn Asn Gin Leu Arg Tyr Ser Tyr Asp Asn Leu He Gly Ser Ser 
565 570 575 

Gin Leu Glu Leu Asp Ser Lys Gly Glu He He Ser Glu Glu Glu Tyr 
580 585 590 

Tyr Pro Tyr Gly Gly Thr Ala Leu Trp Ala Thr Arg Lys Arg Thr Glu 
595 600 605 

Ala Ser Tyr Lys Thr He Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr 
610 615 620 

Gly Leu Tyr Tyr Tyr Gly Tyr Arg Tyr Tyr Gin Pro Trp Val Gly Arg 
625 630 635 640 

Trp Leu Ser Ala Asp Pro Ala Gly Thr Val Asp Gly Leu Asn Leu Tyr 
645 650 655 

Arg Met Val Arg Asn Asn Pro Val Thr Leu Leu Asp Pro Asp Gly Leu 
660 665 670 

Met Pro Thr He Ala Glu Arg He Ala Ala Leu Gin Lys Asn Lys Val 
675 680 685 

Ala Asp Ser Ala Pro Ser Pro Thr Asn Ala Thr Asn Val Ala He Asn 
690 695 700 

He Arg Pro Pro Val Ala Pro Lys Pro Thr Leu Pro Lys Ala Ser Thr 
705 710 715 720 

Ser Ser Gin Ser Thr Thr Tyr Pro He Lys Ser Ala Ser He Lys Pro 
725 730 735 

Thr Thr Ser Gly Ser Ser He Thr Ala Pro Leu Ser Pro Val Gly Asn 
740 745 750 

Lys Ser Thr Pro Glu He Ser Leu Pro Glu Ser Thr Gin Ser Asn Ser 
755 760 765 

Ser Ser Ala He Ser Thr Asn Leu Gin Lys Lys Ser Phe Thr Leu Tyr 
770 775 780 

Arg Ala Asp Asn Arg Ser Phe Glu Asp Met Gin Ser Lys Phe Pro Glu 
785 790 795 800 

Gly Phe Lys Ala Trp Thr Pro Leu Asp Thr Lys Met Ala Arg Gin Phe 
805 810 815 
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Ala Ser Val Phe lie Gly Gin Lys Asp Thr Ser Asn Leu Pro Lys Glu 
820 825 830 

Thr Val Lys Asn lie Asn Thr Trp Gly Thr Lys Pro Lys Leu Asn Asp 
835 840 845 

Leu Ser Thr Tyr lie Lys Tyr Thr Lys Asp Lys Ser Thr Val Trp Val 
850 855 860 

Ser Thr Ala lie Asn Thr Glu Ala Gly Gly Gin Ser Ser Gly Ala Pro 
865 870 875 880 

Leu His Glu lie Asn Met Asp Leu Tyr Glu Phe Thr lie Asp Gly Gin 
885 890 895 

Lys Leu Asn Pro Leu Pro Arg Gly Arg Ser Lys Asp Arg Val Pro Ser 
900 905 910 

Leu Leu Leu Asp Thr Pro Glu lie Glu Thr Ala Ser He He Ala Leu 
915 920 925 

Asn His Gly Pro Val Asn Asp Ala Glu Val Ser Phe Leu Thr Thr He 
930 935 940 

Pro Leu Lys Asn Val Lys Pro Tyr Lys Arg 
945 950 



<210> 13 
<211> 2522 
<212> PRT 

<213> Photorhabdus luminescens 
<400> 13 

Met He Leu Lys Gly He Asn Met Asn Ser Pro Val Lys Glu He Pro 
15 10 15 

Asp Val Leu Lys He Gin Cys Gly Phe Gin Cys Leu Thr Asp He Ser 
20 25 30 

His Ser Ser Phe Asn Glu Phe His Gin Gin Val Ser Glu His Leu Ser 
35 40 45 

Trp Ser Glu Ala His Asp Leu Tyr His Asp Ala Gin Gin Ala Gin Lys 
50 55 60 

Asp Asn Arg Leu Tyr Glu Ala Arg He Leu Lys Arg Thr Asn Pro Gin 
65 70 75 80 

Leu Gin Asn Ala Val His Leu Ala He Val Ala Pro Asn Ala Glu Leu 
85 90 95 

He Gly Tyr Asn Asn Gin Phe Ser Gly Arg Ala Ser Gin Tyr Val Ala 
100 105 110 

Pro Gly Thr Val Ser Ser Met Phe Ser Pro Ala Ala Tyr Leu Thr Glu 
115 120 125 

Leu Tyr Arg Glu Ala Arg Asn Leu His Ala Ser Asp Ser Val Tyr Arg 
130 135 140 

Leu Asp Thr Arg Arg Pro Asp Leu Lys Ser Met Ala Leu Ser Gin Gin 
145 150 155 160 
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Asn Met Asp Thr Glu Leu Ser Thr Leu Ser Leu Ser Asn Glu Leu Leu 
165 170 175 

Leu Glu Ser He Lys Thr Glu Ser Lys Leu Asp Asn Tyr Thr Gin Val 
180 185 190 

Met Glu Met Leu Ser Ala Phe Arg Pro Ser Gly Ala Ihr Pro Tyr His 
195 200 205 

Asp Ala Tyr Glu Asn Val Arg Lys Val He Gin Leu Gin Asp Pro Gly 
210 215 220 

Leu Glu Gin Leu Asn Ala Ser Pro Ala He Ala Gly Leu Met His Gin 
225 230 235 240 

Ala Ser Leu Leu Gly He Asn Ala Ser He Ser Pro Glu Leu Phe Asn 
245 250 255 

He Leu Thr Glu Glu He Thr Glu Gly Asn Ala Glu Glu Leu Tyr Lys 
260 265 270 

Lys Asn Phe Gly Asn He Glu Pro Ala Ser Leu Ala Met Pro Glu Tyr 
275 280 285 

Leu Arg Arg Tyr tyr Asn Leu Ser Asp Glu Glu Leu Ser Gin Phe He 
290 295 300 

Gly Lys Ala Ser Asn Phe Gly Gin Gin Glu Tyr Ser Asn Asn Gin Leu 
305 310 315 320 

He Thr Pro He Val Asn Ser Asn Asp Gly Ihr Val Lys Val Tyr Arg 
325 330 335 

He Thr Arg Glu Tyr Thr Thr Asn Ala Asn Gin Val Asp Val Glu Leu 
340 345 350 

Phe Pro Tyr Gly Gly Glu Asn Tyr Gin Leu Asn Tyr Lys Phe Lys Asp 
355 360 365 

Ser Arg Gin Asp Val Ser Tyr Leu Ser He Lys Leu Asn Asp Lys Arg 
370 375 380 

Glu Leu He Arg He Glu Gly Ala Pro Gin Val Asn He Glu Tyr Ser 
385 390 395 400 

Glu His He Thr Leu Ser Thr Thr Asp He Ser Gin Pro Phe Glu He 
405 410 415 

Gly Leu Thr Arg Val Tyr Pro Ser Ser Ser Trp Ala Tyr Ala Ala Ala 
420 425 430 

Lys Phe Thr He Glu Glu Tyr Asn Gin Tyr Ser Phe Leu Leu Lys Leu 
435 440 445 

Asn Lys Ala He Arg Leu Ser Arg Ala Thr Glu Leu Ser Pro Thr He 
450 455 460 

Leu Glu Ser He Val Arg Ser Val Asn Gin Gin Leu Asp He Asn Ala 
465 470 475 480 

Glu Val Leu Gly Lys Val Phe Leu Thr Lys Tyr Tyr Met Gin Arg Tyr 
485 490 495 

Ala He Asn Ala Glu Thr Ala Leu He Leu Cys Asn Ala Leu He Ser 
500 505 510 
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Gln Arg Ser Tyr Asp Asn Gin Pro Ser Gin Phe Asp Arg Leu Phe Asn 
515 520 , 525 

Thr Pro Leu Leu Asn Gly Gin Tyr Phe Ser Thr Gly Asp Glu Glu He 
530 535 540 

Asp Leu Asn Pro Gly Ser Thr Gly Asp Trp Arg Lys Ser Val Leu Lys 
545 550 555 560 

Arg Ala Phe Asn lie Asp Asp lie Ser Leu Tyr Arg Leu Leu Lys He 
565 570 575 

Thr Asn His Asn Asn Gin Asp Gly Lys He Lys Asn Asn Leu Asn Asn 
580 585 590 

Leu Ser Asp Leu Tyr He Gly Lys Leu Leu Ala Glu He His Gin Leu 
595 600 605 

Ihr He Asp Glu Leu Asp Leu Leu Leu Val Ala Val Gly Glu Gly Glu 
610 615 620 

Ihr Asn Leu Ser Ala He Ser Asp Lys Gin Leu Ala Ala Leu He Arg 
625 630 635 640 

Lys Leu Asn Ihr He Thr Val Trp Leu Gin Thr Gin Lys Trp Ser Ala 
645 650 655 

Phe Gin Leu Phe Val Met Thr Ser Thr Ser Tyr Asn Lys Thr Leu Thr 
660 665 670 

Pro Glu He Lys Asn Leu Leu Asp Thr Val Tyr His Gly Leu Gin Gly 
675 680 685 

Phe Asp Lys Asp Lys Ala Asn Leu Leu His Val Met Ala Pro Tyr He 
690 695 700 

Ala Ala Thr Leu Gin Leu Ser Ser Glu Asn Val Ala His Ser Val Leu 
705 710 715 720 

Leu Trp Ala Asp Lys Leu Lys Pro Gly Asp Gly Ala Met Ihr Ala Glu 
725 730 735 

Lys Phe Trp Asp Trp Leu Asn Thr Gin Tyr Thr Pro Asp Ser Ser Glu 
740 745 750 

Val Leu Ala Thr Gin Glu His He Val Gin Tyr Cys Gin Ala Leu Ala 
755 760 765 

Gin Leu Glu Met Val Tyr His Ser Thr Gly He Asn Glu Asn Ala Phe 
770 775 780 

Arg Leu Phe Val Thr Lys Pro Glu Met Phe Gly Ser Ser Thr Glu Ala 
785 790 795 BOO 

Val Pro Ala His Asp Ala Leu Ser Leu He Met Leu Thr Arg Phe Ala 
805 810 815 

Asp Trp Val Asn Ala Leu Gly Glu Lys Ala Ser Ser Val Leu Ala Ala 
820 825 830 



Phe Glu Ala Asn Ser Leu Thr Ala Glu Gin Leu Ala Asp Ala Met Asn 
835 840 845 

Leu Asp Ala Asn Leu Leu Leu Gin Ala Ser Thr Gin Ala Gin Asn His 
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850 



855 



860 



Gin His Leu Pro Pro Val Thr Gin Lys Asn Ala Phe Ser Cys Trp ihr 
865 870 875 880 

Ser lie Asp Ihr He Leu Gin Trp Val Asn Val Ala Gin Gin Leu Asn 
885 890 895 

Val Ala Pro Gin Gly Val Ser Ala Leu Val Gly Leu Asp Tyr He Gin 
900 905 910 

Leu Asn Gin Lys He Pro Ihr Tyr Ala Gin Trp Glu Ser Ala Gly Glu 
915 920 925 

He Leu Ihr Ala Gly Leu Asn Ser Gin Gin Ala Asp He Leu His Ala 
930 935 940 

Phe Leu Asp Glu Ser Arg Ser Ala Ala Leu Ser Ihr Tyr Tyr He Arg 
945 950 955 960 

Gin Val Ala Lys Pro Ala Ala Ala He Lys Ser Arg Asp Asp Leu Tyr 
965 970 975 

Gin Tyr Leu Leu lie Asp Asn Gin Val Ser Ala Ala He Lys Ihr Thr 
980 985 990 

Arg He Ala Glu Ala He Ala Ser He Gin Leu Tyr Val Asn Arg Thr 
995 1000 1005 

Leu Glu Asn Val Glu Glu Asn Ala His Ser Gly Val He Ser Arg Gin 
1010 1015 1020 

Phe Phe He Asp Trp Asp Lys Tyr Asn Lys Arg Tyr Ser Thr Trp Ala 
025 1030 1035 1040 

Gly Val Ser Gin Leu Val Tyr Tyr Pro Glu Asn Tyr He Asp Pro Ihr 
1045 1050 1055 

Met Arg He Gly Gin Thr Lys Met Met Asp Ala Leu Leu Gin Ser Val 
1060 1065 1070 

Ser Gin Ser Gin Leu Asn Ala Asp Thr Val Glu Asp Ala Phe Met Ser 
1075 1080 1085 

Tyr Leu Thr Ser Phe Glu Gin Val Ala Asn Leu Lys Val He Ser Ala 
1090 1095 1100 

Tyr His Asp Asn He Asn Asn Asp Gin Gly Leu Thr Tyr Phe He Gly 
105 1110 1115 1120 

Leu Ser Glu Thr Asp Thr Gly Glu Tyr Tyr Trp Arg Ser Val Asp His 
1125 1130 1135 

Ser Lys Phe Ser Asp Gly Lys Phe Ala Ala Asn Ala Trp Ser Glu Trp 
1140 1145 1150 

His Lys He Asp Cys Pro He Asn Pro Tyr Arg Ser Thr He Arg Pro 
1155 1160 1165 

Val Met Tyr Lys Ser Arg Leu Tyr Leu Leu Trp Leu Glu Gin Lys Glu 
1170 1175 1180 

He Thr Lys Gin Thr Gly Asn Ser Lys Asp Gly Tyr Gin Thr Glu Thr 
185 1190 1195 1200 
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Asp TVr Arg iyr Glu Leu Lys Leu Ala His lie Arg Tyr Asp Gly Thr 
1205 1210 1215 

Trp Asn Thr Pro He Thr Phe Asp Val Asn Glu Lys He Ser Lys Leu 
1220 1225 1230 

Glu Leu Ala lys Asn Lys Ala Pro Gly Leu Tyr Cys Ala Gly Tyr Gin 
1235 1240 1245 

Gly Glu Asp Thr Leu Leu Val Met Phe Tyr Asn Gin Gin Asp Ihr Leu 
1250 1255 1260 

Asp Ser Tyr Lys Ihr Ala Ser Met Gin Gly Leu Tyr He Phe Ala Asp 
265 1270 1275 1280 

Met Glu Tyr Lys Asp Met Thr Asp Gly Gin Tyr Lys Ser Tyr Arg Asp 
1285 1290 1295 

Asn Ser Tyr Lys Gin Phe Asp Thr Asn Ser Val Arg Arg Val Asn Asn 
1300 1305 1310 

Arg Tyr Ala Glu Asp Tyr Glu He Pro Ser Ser Val Asn Ser Arg Lys 
1315 1320 1325 

Gly Tyr Asp Trp Gly Asp Tyr Tyr Leu Ser Met Val Tyr Asn Gly Asp 
1330 1335 1340 

He Pro Thr He Ser Tyr Lys Ala Thr Ser Ser Asp Leu Lys He Tyr 
345 1350 1355 1360 

He Ser Pro Lys Leu Arg He He His Asn Gly Tyr Glu Gly Gin Gin 
1365 1370 1375 

Arg Asn Gin Cys Asn Leu Met Asn Lys Tyr Gly Lys Leu Gly Asp Lys 
1380 1385 1390 

Phe He Val Tyr Thr Ser Leu Gly Val Asn Pro Asn Asn Ser Ser Asn 
1395 ~ 1400 1405 

Lys Leu Met Phe Tyr Pro Val Tyr Gin Tyr Asn Gly Asn Val Ser Gly 
1410 1415 1420 

Leu Ser Gin Gly Arg Leu Leu Phe His Arg Asp Thr Asn Tyr Ser Ser 
425 1430 1435 1440 

Lys Val Glu Ala Trp He Pro Gly Ala Gly Arg Ser Leu Thr Asn Pro 
1445 1450 1455 

Asn Ala Ala He Gly Asp Asp Tyr Ala Thr Asp Ser Leu Asn Lys Pro 
1460 1465 1470 

Asn Asp Leu Lys Gin Tyr Val Tyr Met Thr Asp Ser Lys Gly Thr Ala 
1475 1480 1485 

Thr Asp Val Ser Gly Pro Val Asp He Asn Thr Ala He Ser Pro Ala 
1490 1495 1500 

Lys Val Gin Val Thr Val Lys Ala Gly Ser Lys Glu Gin Thr Phe Thr 
505 1510 1515 1520 

Ala Asp Lys Asn Val Ser lie Gin Pro Ser Pro Ser Phe Asp Glu Met 
1525 1530 1535 

Asn Tyr Gin Phe Asn Ala Leu Glu He Asp Gly Ser Ser Leu Asn Phe 
1540 1545 1550 
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Thr Asn Asn Ser Ala Ser He Asp He Thr Phe Thr Ala Phe Ala Glu 
1555 1560 1565 

Asp Gly Arg Lys Leu Gly Tyr Glu Ser Phe Ser He Pro He Thr Arg 
1570 1575 1580 

Lys Val Ser Thr Asp Asn Ser Leu Thr Leu Arg His Asn Glu Asn Gly 
585 1590 1595 1600 

Ala Gin Tyr Met Gin Trp Gly Val Tyr Arg He Arg Leu Asn Thr Leu 
1605 1610 1615 

Phe Ala Arg Gin Leu Val Ala Arg Ala Thr Thr Gly He Asp Thr He 
1620 1625 1630 

Leu Ser Met Glu Thr Gin Asn He Gin Glu Pro Gin Leu Gly Lys Gly 
1635 1640 1645 

Phe Tyr Ala Thr Phe Val He Pro Pro Tyr Asn Pro Ser Thr His Gly 
1650 1655 1660 

Asp Glu Arg Trp Phe Lys Leu Tyr He Lys His Val Val Asp Asn Asn 
665 ~ 1670 1675 1680 

Ser His He He Tyr Ser Gly Gin Leu Lys Asp Thr Asn He Ser Thr 
1685 1690 1695 

Thr Leu Phe He Pro Leu Asp Asp Val Pro Leu Asn Gin Asp Tyr Ser 
1700 1705 1710 

Ala Lys Val Tyr Met Thr Phe Lys Lys Ser Pro Ser Asp Gly Thr Trp 
1715 1720 1725 

Trp Gly Pro His Phe Val Arg Asp Asp Lys Gly He Val Thr He Asn 
1730 1735 1740 

Pro Lys Ser He Leu Thr His Phe Glu Ser Val Asn Val Leu Asn Asn 
745 1750 1755 1760 

He Ser Ser Glu Pro Met Asp Phe Ser Gly Ala Asn Ser Leu Tyr Phe 
1765 1770 1775 

Trp Glu Leu Phe Tyr Tyr Thr Pro Met Leu Val Ala Gin Arg Leu Leu 
1780 1785 1790 

His Glu Gin Asn Phe Asp Glu Ala Asn Arg Trp Leu Lys Tyr Val Trp 
1795 1800 1805 

Ser Pro Ser Gly Tyr He Val His Gly Gin He Gin Asn Tyr Gin Trp 
1810 ' 1815 1820 



Asn Val Arg Pro Leu Leu Glu Asp Thr Ser Trp Asn Ser Asp Pro Leu 
825 1830 1835 1840 

Asp Ser Val Asp Pro Asp Ala Val Ala Gin His Asp Pro Met His Tyr 
1845 1850 1855 

Lys Val Ser Thr Phe Met Arg Thr Leu Asp Leu Leu He Ala Arg Gly 
1860 1865 1870 

Asp His Ala Tyr Arg Gin Leu Glu Arg Asp Thr Leu Asn Glu Ala Lys 
1875 1880 1885 

Met Trp Tyr Met Gin Ala Leu His Leu Leu Gly Asp Lys Pro Tyr Leu 
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1890 



1895 



1900 



Pro Leu Ser Thr Thr Trp Asn Asp Pro Arg Leu Asp Lys Ala Ala Asp 
905 1910 1915 1920 

He Thr Thr Gin Ser Ala His Ser Ser Ser He Val Ala leu Arg Gin 
1925 1930 1935 

Ser Thr Pro Ala Leu Leu Ser Leu Arg Ser Ala Asn Thr Leu Thr Asp 
1940 1945 1950 

Leu Phe Leu Pro Gin He Asn Glu Val Met Met Asn Tyr Trp Gin Thr 
1955 I960 1965 

Leu Ala Gin Arg Val Tyr Asn Leu Arg His Asn Leu Ser lie Asp Gly 
1970 1975 1980 

Gin Pro Leu Tyr Leu Pro lie Tyr Ala Thr Pro Ala Asp Pro Lys Ala 
985 1990 1995 2000 

Leu Leu Ser Ala Ala Val Ala Thr Ser Gin Gly Gly Gly Lys leu Pro 
2005 2010 2015 

Glu Ser Phe Met Ser Leu Trp Arg Phe Pro His Met Leu Glu Asn Ala 
2020 2025 2030 

Arg Ser Met Val Ser Gin Leu Thr Gin Phe Gly Ser Thr Leu Gin Asn 
2035 2040 2045 

He He Glu Arg Gin Asp Ala Glu Ala Leu Asn Ala leu Leu Gin Asn 
2050 2055 2060 

Gin Ala Ala Glu Leu He Leu Thr Asn Leu Ser He Gin Asp Lys Thr 
065 2070 2075 2080 

He Glu Glu Leu Asp Ala Glu Lys Thr Val Leu Glu Lys Ser Lys Ala 
2085 2090 2095 

Gly Ala Gin Ser Arg Phe Asp Ser Tyr Ser Lys Leu His Asp Glu Asn 
2100 2105 2110 

He Asn Ala Gly Glu Asn Gin Ala Met Thr Leu Arg Ala Ser Ala Ala 
2115 2120 2125 

Gly Leu Thr Thr Ala Val Gin Ala Ser Arg Leu Ala Gly Ala Ala Ala 
2130 2135 2140 

Asp Leu Val Pro Asn He Phe Gly Phe Ala Gly Gly Gly Ser Arg Trp 
145 2150 2155 2160 

Gly Ala He Ala Glu Ala Thr Gly Tyr Val Met Glu Phe Ser Ala Asn 
2165 2170 2175 

Val Met Asn Thr Glu Ala Asp Lys He Ser Gin Ser Glu Thr Tyr Arg 
2180 2185 2190 

Arg Arg Arg Gin Glu Trp Glu He Gin Arg Asn Asn Ala Glu Ala Glu 
2195 2200 2205 

Leu Lys Gin Leu Asp Ala Gin Leu Lys Ser Leu Ala Val Arg Arg Glu 
2210 2215 2220 

Ala Ala Val leu Gin Lys Thr Ser leu Lys Thr Gin Gin Glu Gin Thr 
225 2230 2235 2240 
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Gln Ala Gin Leu Ala Phe Leu Gin Arg Lys Phe Ser Asn Gin Ala Leu 
2245 2250 2255 

Tyr Asn Trp Leu Arg Gly Arg Leu Ala Ala He Tyr Phe Gin Phe Tyr 
2260 2265 2270 

Asp Leu Ala He Ala Arg Cys Leu Met Ala Glu Gin Ala Tyr Arg Trp 
2275 2280 2285 

Glu He Ser Asp Asp Ser Ala Arg Phe He Lys Pro Gly Ala Trp Gin 
2290 2295 2300 

Gly Thr Tyr Ala Gly Leu Leu Ala Gly Glu Thr Leu Met Leu Ser Leu 
305 ' 2310 2315 2320 

Ala Gin Met Glu Asp Ala His Leu Arg Arg Asp Lys Arg Ala Leu Glu 
2325 2330 2335 

Val Glu Arg Thr Val Ser Leu Ala Glu He iyr Ala Gly Leu Pro Gin 
2340 2345 2350 

Asp Lys Gly Pro Phe Ser Leu Thr Gin Glu He Glu Lys Leu Val Asn 
2355 2360 2365 

Ala Gly Ser Gly Ser Ala Gly Ser Gly Asn Asn Asn Leu Ala the Gly 
2370 2375 2380 

Ala Gly Ihr Asp Thr Lys Thr Ser Leu Gin Ala Ser He Ser Leu Ala 
385 2390 2395 2400 

Asp Leu Lys He Arg Glu Asp Tyr Pro Glu Ser He Gly Lys He Arg 
2405 2410 2415 

Arg He Lys Gin He Ser Val Thr Leu Pro Ala Leu Leu Gly Pro Tyr 
2420 2425 2430 

Gin Asp Val Gin Ala He Leu Ser Tyr Gly Asp Lys Ala Gly Leu Ala 
2435 2440 2445 

Asn Gly Cys Ala Ala Leu Ala Val Ser His Gly Thr Asn Asp Ser Gly 
2450 2455 2460 

Gin Phe Gin Leu Asp Phe Asn Asp Gly lys Phe Leu Pro Phe Glu Gly 
465 2470 2475 2480 

He Ala He Asp Gin Gly Thr Leu Thr Leu Ser Phe Pro Asn Ala Ser 
2485 2490 2495 

Thr Pro Ala Lys Gly Lys Gin Ala Thr Met Leu Lys Thr Leu Asn Asp 
2500 2505 2510 

He He Leu His He Arg Tyr Thr He Lys 



<210> 14 
<211> 1481 
<212> PRT 

<213> Photorhabdus luminescens 
<400> 14 

Met Gin Asn Ser Gin Thr Phe Ser Met Thr Glu Leu Ser Leu Pro Lys 
15 10 15 



2515 



2520 



Gly Gly Gly Ala He Thr Gly Met Gly Glu Ala Leu Thr Pro Ala Gly 
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20 



25 



30 



Pro Asp Gly Met Ala Ala Lai Ser Leu Pro Leu Pro He Ser Ala Gly 
35 40 45 

Arg Gly Tyr Ala Pro Ser Leu Thr Leu Asn Tyr Asn Ser Gly Thr Gly 
50 55 60 

Asn Ser Pro Phe Gly Leu Gly Trp Asp Cys Asn Val Met Thr He Arg 
65 70 75 80 

Arg Arg Thr Ser Thr Gly Val Pro Asn Tyr Asp Glu Thr Asp Thr Phe 
85 90 95 

Leu Gly Pro Glu Gly Glu Val Leu Val Val Ala Leu Asn Glu Ala Gly 
100 105 110 

Gin Ala Asp He Arg Ser Glu Ser Ser Leu Gin Gly He Asn Leu Gly 
115 120 125 

Met Thr Phe Thr Val Thr Gly Tyr Arg Ser Arg Leu Glu Ser His Phe 
130 135 " 140 

Ser Arg Leu Glu Tyr Trp Gin Pro Gin Thr Thr Gly Ala Thr Asp Phe 
145 150 155 160 

Trp Leu He Tyr Ser Pro Asp Gly Gin Ala His Leu Leu Gly Lys Asn 
165 170 175 

Pro Gin Ala Arg He Ser Asn Pro Leu Asn Val Asn Gin Thr Ala Gin 
180 185 190 

Trp Leu Leu Glu Ala Ser Val Ser Ser His Gly Glu Gin He Tyr Tyr 
195 200 205 

Gin Tyr Arg Ala Glu Asp Glu Thr Asp Cys Glu Thr Asp Glu Leu Thr 
210 215 220 

Ala His Pro Asn Thr Thr Val Gin Arg Tyr Leu Gin Val Val His Tyr 
225 230 235 240 

Gly Asn Leu Thr Ala Ser Glu Val Phe Pro Thr Leu Asn Gly Asp Asp 
245 250 255 

Pro Leu Lys Ser Gly Trp Leu Phe cys Leu Val Phe Asp Tyr Gly Glu 
260 265 270 

Arg Lys Asn Ser Leu Ser Glu Met Pro Pro Phe Lys Ala Thr Ser Asn 
275 280 285 

Trp Leu cys Arg Lys Asp Arg Phe Ser Arg Tyr Glu Tyr Gly Phe Ala 
290 295 300 

Leu Arg Thr Arg Arg Leu Cys Arg Gin He Leu Met Phe His Arg Leu 
305 310 315 320 

Gin Thr Leu Ser Gly. Gin Ala Lys Gly Asp Asp Glu Pro Ala Leu Val 
325 330 335 

Ser Arg Leu He Leu Asp Tyr Asp Glu Asn Ala Val Val Ser Thr Leu 
340 345 350 



Val Ser Val Arg Arg Val Gly His Glu Gin Asp Gly Thr Thr Ala Val 
355 360 365 
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Ala Leu Pro Pro Leu Glu Leu Ala Tyr Gin Pro Phe Glu Pro Glu Gin 
370 375 380 

Lys Ala Leu Trp Arg Pro Met Asp Val Leu Ala Asn Phe Asn Thr He 
385 390 395 400 

Gin Arg Trp Gin Leu Leu Asp Leu Gin Gly Glu Gly Val Pro Gly He 
405 410 415 

Leu Tyr Gin Asp Lys Asn Gly Trp Trp Tyr Arg Ser Ala Gin Arg Gin 
420 . 425 430 

Thr Gly Glu Glu Met Asn Ala Val Thr Trp Gly Lys Met Gin Leu Leu 
435 440 445 

Pro He Thr Pro Ala He Gin Asp Asn Ala Ser Leu Met Asp He Asn 
450 455 A 460 

Gly Asp Gly Gin Leu Asp Trp Val He Thr Gly Pro Gly Leu Arg Gly 
465 470 475 480 

Tyr His Ser Gin His Pro Asp Gly Ser Trp Thr Arg Phe Ihr Pro Leu 
485 490 495 

His Ala Leu Pro He Glu Tyr Thr His Pro Arg Ala Gin Leu Ala Asp 
500 505 510 

Leu Met Gly Ala Gly Leu Ser Asp Leu Val Leu He Gly Pro Lys Ser 
515 520 525 

Val Arg Leu Tyr Ala Asn Asn Arg Asp Gly Phe Thr Glu Gly Arg Asp 
530 535 540 

Val Val Gin Ser Gly Gly He Thr Leu Pro Leu Pro Gly Ala Asp Ala 
545 550 555 560 

Arg lys Leu Val Ala Phe Ser Asp Val Leu Gly Ser Gly Gin Ala His 
565 570 575 

Leu Val Glu Val Ser Ala Thr Lys Val Ihr Cys Trp Pro Asn Leu Gly 
580 585 590 

His Gly Arg Phe Gly Gin Pro He Thr Leu Pro Gly Phe Ser Gin Ser 
595 600 605 

Ala Ala Asn Phe Asn Pro Asp Arg Val His Leu Ala Asp Leu Asp Gly 
610 615 620 ' 

Ser Gly Pro Ala Asp Leu He Tyr Val His Ala Asp His Leu Asp He 
625 630 635 640 

Phe Ser Asn Glu Ser Gly Asn Gly Phe Ala Gin Pro Phe Ihr Leu Arg 
645 650 655 

Phe Pro Asp Gly Leu Arg Phe Asp Asp Thr Cys Gin Leu Gin Val Ala 
660 665 670 

Asp Val Gin Gly Leu Gly Val Val Ser Leu He Leu Ser Val Pro His 
675 680 685 

Met Ala Pro His His Trp Arg Cys Asp Leu Ihr Asn Ala Lys Pro Trp 
690 695 700 

Leu Leu Ser Glu Met Asn Asn Asn Met Gly Ala His His Ihr Leu His 
705 710 715 720 
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TVr Arg Ser Ser Val Gin Phe Trp Leu Asp Glu Lys Ala Ala Ala leu 
725 730 735 

Ala Thr Gly Gin Thr Pro Val Cys Tyr Leu Pro Phe Pro Val His Thr 
740 745 750 

Leu Trp Gin Ihr Glu Thr Glu Asp Glu lie Ser Gly Asn Lys Leu Val 
755 760 765 

Ihr Ihr Leu Arg iyr Ala His Gly Ala Trp Asp Gly Arg Glu Arg Glu 
770 775 780 

Phe Arg Gly Phe Gly Tyr Val Glu Gin Ihr Asp Ser His Gin Leu Ala 
785 790 795 800 

Gin Gly Asn Ala Pro Glu Arg Thr Ser Pro Ala Leu Thr Lys Asn Trp 
805 810 815 

Tyr Ala Ihr Gly lie Pro Glu Val Asp Asn Thr Leu Ser Ala Gly Tyr 
820 825 830 

Trp Arg Gly Asp Ihr Gin Ala Phe Ihr Gly Ehe Thr Pro His Fhe Thr 
835 840 . 845 

Leu Trp Lys Glu Gly Lys Asp Val Pro Leu Ihr Pro Glu Asp Asp His 
850 855 860 

Asn Leu iyr Trp Leu Asn Arg Ala Leu Lys Gly Gin Pro Leu Arg Ser 
865 " 870 875 880 

Glu Leu Tyr Gly Leu Asp Gly Ser Ala Gin Gin Lys lie Pro Tyr Ihr 
885 890 895 

Val Thr Glu Ser Arg Pro Gin Val Arg Gin Leu Gin Asp Asn Thr Thr 
900 905 910 

Leu Ser Pro Val Leu Trp Ala Ser Val Val Glu Ser Arg Ser Tyr His 
915 920 925 

Tyr Glu Arg He He Ser Asp Pro Gin Cys Asn Gin Asp He Ihr Leu 
930 935 940 

Ser Ser Asp Leu Phe Gly Gin Pro Leu Lys Gin Val Ser Val Gin Tyr 
945 950 955 960 

Pro Arg Arg Asn Lys Pro Thr Thr Asn Pro Tyr Pro Asp Thr Leu Pro 
965 970 975 

Asp Thr Leu Phe Ala Ser Ser Tyr Asp Asp Gin Gin Gin leu Leu Arg 
980 985 990 

Leu Thr Tyr Gin Gin Ser Ser Trp His His Leu He Ala Asn Glu Leu 
995 1000 1005 

Arg Val Leu Gly Leu Pro Asp Gly Thr Arg Ser Asp Ala Phe Thr Tyr 
1010 ^ 1015 1020 

Asp Ala Lys His Val Pro Val Asp Gly Leu Asn leu Glu Ala Leu Cys 
025 1030 1035 1040 

Ala Glu Asn Ser Leu He Ala Asp Asp Lys Pro Arg Glu Tyr Leu Asn 
1045 1050 1055 

Gin Gin Arg Thr Phe Tyr Thr Asp Gly Lys Thr Asp Gly Lys Asn Pro 
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1060 1065 1070 

Thr Pro Leu Lys Thr Pro Thr Arg Gin Ala Leu He Ala Phe Thr Glu 
1075 1080 1085 

Thr Ala Val Leu Thr Glu Ser Leu Leu Ser Ala Phe Asp Gly Gly He 
1090 1095 1100 

Thr Pro Asp Glu Leu Pro Gly Leu Leu Ihr Gin Ala Gly Tyr Gin Gin 
105 1110 1115 1120 

Glu Pro Tyr Leu Phe Pro Leu Ser Gly Glu Asn Gin Val Trp Val Ala 
1125 1130 1135 

Arg Lys Gly Tyr Ihr Asp Tyr Gly Ihr Glu Val Gin Phe Trp Arg Pro 
1140 1145 1150 

Val Ala Gin Arg Asn Ihr Gin Leu Ihr Gly Lys Thr Thr Leu Lys Trp 
1155 1160 1165 

Asp Ihr His Tyr Cys Val He Thr Gin Thr Gin Asp Ala Ala Gly Leu 
1170 1175 1180 

Thr Val Ser Ala Asn Tyr Asp Trp Arg Phe Leu Thr Pro Met Gin Leu 
185 1190 1195 1200 

Thr Asp He Asn Asp Asn Val His He He Thr Leu Asp Ala Leu Gly 
1205 1210 1215 

Arg Pro Val Thr Gin Arg Ehe Trp Gly He Glu Asn Gly Val Ala Thr 
1220 1225 1230 

Gly Tyr Ser Ser Pro Glu Ala Lys Pro Ehe Thr Pro Pro Val Asp Val 
1235 1240 1245 

Asn Ala Ala He Ala Leu Thr Gly Pro Leu Pro Val Ala Gin Cys Leu 
1250 1255 1260 

Val Tyr Ala Pro Asp Ser Trp Met Pro Leu Phe Gly Gin Glu Thr Phe 
265 1270 1275 1280 

Asn Thr Leu Thr Gin Glu Glu Gin lys Thr Leu Arg Asp Leu Arg He 
1285 1290 1295 

He Thr Glu Asp Trp Arg He Cys Ala Leu Ala Arg Arg Arg Trp Leu 
1300 1305 1310 

Gin Ser Gin Lys Ala Gly Thr Pro Leu Val Lys Leu Leu Thr Asn Ser 
1315 1320 1325 

He Gly Leu Pro Pro His Asn Leu Met Leu Ala Thr Asp Arg Tyr Asp 
1330 1335 1340 

Arg Asp Ser Glu Gin Gin He Arg Gin Gin Val Ala Phe Ser Asp Gly 
345 1350 1355 1360 

Phe Gly Arg Leu Leu Gin Ala Ala Val Arg His Glu Ala Gly Glu Ala 
1365 1370 1375 

Trp Gin Arg Asn Gin Asp Gly Ser Leu Val Thr Lys Met Glu Asp Thr 
1380 1385 1390 

Lys Thr Arg Trp Ala He Thr Gly Arg Thr Glu Tyr Asp Asn Lys Gly 
1395 1400 1405 
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Gin Ala lie Arg Thr Tyr Gin Pro Tyr Phe Leu Asn Asp Trp Arg Tyr 
1410 1415 1420 

Val Ser Asp Asp Ser Ala Arg Lys Glu Ala Tyr Ala Asp Thr His He 
425 * 1430 1435 1440 

iyr Asp Pro He Gly Arg Glu He Gin Val He Thr Ala Lys Gly Trp 
1445 1450 1455 

Leu Arg Gin Asn Gin Tyr Phe Pro Trp Phe Thr Val Ser Glu Asp Glu 
1460 1465 1470 

Asn Asp Leu Ser Ala Asp Ala Leu Val 
1475 1480 



<210> 15 
<211> 23 
<212> ENA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: oligonucleotide 



<400> 15 

cgggatccga tgattttaaa agg 23 



<210> 16 
<211> 16 
<212> nsiA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 16 

gcgccattga tttgag 16 



<210> 17 
<211> 19 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :oligcraicleotide 
<400> 17 

cattagaggt cgaacgtac 19 



<210> 18 . 
<211> 26 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence oligonucleotide 
<400> 18 

gagcgagctc ttacttaatg gtgtag 26 



<210> 19 




WO 99/42589 



PCT/EP99/01015 



-69- 



<211> 28 
<212> ENA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : oligonucleotide 
<400> 19 

cagcgagctc catgcagaat tcacagac 



<210> 20 
<211> 18 
<212> ENA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : oligonucleotide 
<400> 20 

ggcaatggca gcgataag 



<210> 21 
<211> 18 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oUgonucleotide 
<400> 21 

cattaacgca ggaagagc 



<210> 22 
<211> 26 
<212> IWA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 



<400> 22 

gacctcgagt tacacgagcg cgtcag 



26 



